2 

CGfi GTA AAT CGT TCT CGA AGC CTT AGT AAT 

arg val asn arg ser arg ser leu ser asn 
62 

TCA GCA GAT GAT GAA GTT CGA TCA ATC ATC 

ser pro asp asp glu val arg ser ile ile 
122 

TGG GTT AAC ACT GOT GOT CCA AAA GCT GCC 
trp val asn thr gly gly pro lys ala ala 

182 

GAA TCA ACA CAG GCT ATG GAT CGA AiST TGT 
q^n^ ser thr gin ala net; asp arg ser cys 

A<g§ TTC TTA CAA ACA TTA ACG GGA CGC TTA 

se^ phe leu gin thr leu -thr gly arg leu 

TTG CAG TGG GTT GTT TGC AGT GGC' AGC 
aXa leu gin trp val val cys ser gly ser 

I «, 

3|6| 

T^DC TTT TTT GAA TTA ATG GTA AAG AGC ATG 
pih phe phe glu leu net val lys ser mat 

D Cadherin 
422 I XXX cleavage xxj 

GAG GCT CCA AGG AAA AGT CGT TTT CCA GAA 

glu ala pro arg lys ser arg phe pro glu 
482 

AGC ACG ATT GCT AGT GAT ATH GTT TCA CGA 
ser -thr lie aXa ser asp lie val ser arg 

542 

CTC AAT ACA AGC CTT GCA TTC TTT CTC AAT 

leu asn thr ser leu ala phe phe leu asn 
602 

GTT TTT AGC CTT ATA AAG TCC TGC TAT AAA 

val phe ser leu lie lys ser cys tyr lys 



32 

AGC AAT CCA GAT ATA TCT GGG ACT CCC ACG 
ser asn pro asp ile ser gly thr pro thr 

92 

GGG AGT AAG GGT TTA GAT CGC TCC AAT ycc 
gly ser lys gly leu asp arg ser asn ser 

152 

CCA TGG GGA TCC AAC CCC AGT CCA AGT GCA 
pro trp gly ser asn pro ser pro ser ala 

212 

4 

AAT CGT ATG TCT TCG CAC ACA GAG ACG TCA 
asn arg net ser ser his thr glu thr ser 

272 

CCA ACT AAA AAG CTT TTT CAC GAG GAG CTG 

pro thr lys lys leu phe his glu glu leu 
332 

GTT CGG GAA TCA GCT TTG CAA CAA GCC TGG 
val arg glu ser ala leu gin gin ala trp 

392 

GTG CAC CAT TTA TAC TTT AAT GAT AAA CTT 
val his his leu tyr phe asn asp lys leu 

452 

CGT TTC ATG GAT GAC ATT GCA GCT CTT GTC 

arg phe net asp asp ile ala ala lea val 
512 

TTT CAG AAG GAC ACA GAA ATG GTT GAG AGA 
phe gin lys asp thr gla net val glu arg 

572 

GAT CTG TTG TCT GTT ATG c^A^ AGA GGA TTT 
asp leu leu ser val met asp arg gly phe 

632 

CAG GTG TCT TCA AAG CTT TAC TCA TTA CCG 
gin val ser ser lys leu tyr ser leu pro 
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662 

AAT CCC AGT GTT CTG GTG TCC TTG AGG CTG 
asn pro ser val leu val ser leu arg leu 

722 

CAC TAT GTT ACA TTA AAC TTA CCC TGC AGC 

his tyr val thr leu asn leu pro cys ser 
782 

TCT GTT TCT TCT GCA ACA TCT CAG AGT TCT 

ser val ser ser a1 a thx ser glu ser ser 
842 

ATT GCA AAT ATG TTT GAA TTA TCC GTG CCT 

lie ala asn met phe glu leu ser val pro 
Cadherln 

I XX EC aicrtiif xx| ' 
JGTG TTA ACA GAG CTG GCT GTC ATO TTA gac 
.^1 leu tJix glu leu ala val lie leu asp 

AAA GTC ATC AAT ATG GTA CAC a^AT TTA 
p:£ys lys val xle asn met val his asn leu 

fi022 

TtCT GAC CCT CAG ATA AAG GCT CGA GTG GCC 
i^er asp pro gin ile lys ala arg val ala 

1,5.082 

j^^TG GAA ACT GTA CCT CAG CTG TAT GAT TTT 
i'^^net glu thr val pro gin leu tyr asp phe 

'^1142 

ATT TGT ATA GCC ACT GAT GAT TAT GAA AGT 

lie cys ile ala tAr asp asp ^yr glu ser 
1202 

GCC ATG GCA ATC GCA GGG ACA TCG GTC CCT 

ala met ala ile ala gly thr ser val pro 
1262 

ACG TCA ACG AGT GGC AGG CAA CAC ACT ACC 

thr ser thr ser gly arg gin his thr thr 
1322 

ATC TGT CTA CTT TOG GTT CTC AAA AAT GCA 
lie cys leu leu trp val leu lys asn ala 




692 

GAT TTT CTA CGA ATC ATC TGC AGT CAT GAG 
asp phe leu axg ile ile cys ser his glu 

752 

TTA CTT ACT CCA CCT GCA TCT CCA TCA CCT 

leu leu thr pro pro ala ser pro ser pro 
812 

GGA TTT TCT ACG AAT GTA CAA GAC rAA AAG 

gly phe ser thr asn val gin asp gin lys 
872 

TTC CGC CAA CAG CAT TAT TTG GCA GGA CTT 
phe arg gin gin his tyr leu ala gly leu 

932 

cct gat get gaa gga ctg TTT GGA TTG CAT 
pro asp ala glu gly leu phe gly leu his 

992 

CTC TCC AGT CAC GAC TCA GAC CCG CGG TAC 

leu ser ser his asp ser asp pro arg tyr 
1052 

ATG TTG TAT CTA CCT CTG ATT GGT ATT ATC 

met leu tyr leu pro leu ile gly ile ile 
1112 

ACA GAA ACT CAC AAT CAA CGA GGA AGA CCA 

thr glu thr his asn. gin arg gly arg pro 
1172 

GAG AGC GGA AGT JmS ATA j^GC CAG ACC GTT 

glu ser gly ser met ile ser gin thr val 
1232 

CAA CTA ACA AGG CCT GGC AGT TTC CTC CTC 

gin leu thr arg pro gly ser phe leu leu 
1292 

TTT TCA GCA GAA TCA AGT CGA AGC CTT TTG 
phe ser ala glu ser ser arg ser leu leu 

1352 

GAT GAA ACA GTT CTA CAG AAG TGG TTT ACA 
asp glu thr val leu gin lyk trp phe thr 
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1382 

GAT CTC TCA GTC TTG GAG CTA AAC CGG CTA 

asp leu ser val leu gin leu asn arg leu 
1442 

TTT GAG TAT AAA GGG AAA AAA GTG TTT GAA 

phe glu nyr lys gly lys lys val phe glu 
1502 

A&A GAC ATG AGA GCA AAG CTT GAA GAA GCT 

lys asp met: arg ala lys leu glu glu ala 

1562 

ATG GTA CGG CGA AGC CGA GGA CAG CTC GAG 

xnel: val arg arg ser arg gly gin leu glu 
1622 

CAA GAA AAT TTG AGG TGG AGG AAA GAT ATG 
gin glu asn leu arg t:rp arg lys asp last, 

■^C AAA TCA AGA GCA GAG ATT GAA CAC GAA 
i^p lys ser arg ala glu ile glu his glu 

rS742 

^^^GCA AAC CTA ATC ATT TTA GAT ACA TTA GAG 

Clala asn leu He lie leu asp thr leu gin 

l-t802 

IlirCC AAA GAG AGC ATT CTT GOT GGA GTG CTA 
i^=%er lys glu ser ile leu gly gly val leu 

i-LJ 

Gl862 

OCAA AGT GCA GTT TAT CTA CAA CAC TGT TTT 
gin ser ala val ^yr leu gin h±s cys phe 

1922 

CCT GAA CTC TTA TTT GAA GAA GAG ACA GAG 
pro gin leu leu phe glu glu glu ^hr. gin 

1982 

CGA CAC TGT AGC AGT AGC ATC GGT ACA A'PA 

arg his cys ser ser ser ile gly ^hr 
2042 

CTA ATG AGG CAA AAC TTT GAG ATT GGG AAT 

leu net arg gin asn phe glu ile gly asn 



1412 

TTA GAT CTG CTT TAT CTC TGT GTG TCT TGC 
leu asp lulu tyr leu cys val ser cys 

1472 

CGA ATG AAT AGC TTG ACC TTT AAG AAA TCA 

arg met asn ser leu thr phe lys lys ser 
1532 

ATT CTT GGG AGC ATA GGT GCC AGG CAA GAA 
ile leu gly ser ile gly ala arg gin glu 

1592 

AGA AGC CCA TCT GGA AGT GCC TTT GGA AGT 

arg ser pro ser gly ser ala phe gly ser 
1652 

ACT CAC TGG CGT CAA AAC ACA GAG AAG CTT 

thr his trp arg gin asn thr glu lys leu 

1712 

GCA CTG ATT GAT GGA AAC CTG GCT ACA GAA 
ala leu ile asp gly asn leu ala thr glu 

1772 

ATT GTT GTT CAG ACC GTT TCT GTA ACG GAA 

ile val val gin thr val ser val thr glu 
1832 

AAA GTG CTA CTA CAC AGC ATG GCC TGT AAC 
lys val leu leu his ser inet ala cys asn 

1892 

GCT ACA CAG AGA GCC TTG GTT TCA AAG TTT 
ala thr gin arg ala leu val ser lys phe 

1952 

CAG TGT GCT GAT TTA TGC CTC AGG CTT CTC 
gin cys ala asp leu cys leu arg leu leu 

2012 

CGG TCA CAC CCC AGT GCC TCC CTT TAC CTA 

arg ser his pro ser ala ser leu tyr leu 
2072 

AAC TTT GCC AGG GTT AAA ATG CAG GTA CCA 

asn phe ala arg val lys aet gin val pro 
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2102 2132 

ATG TCA CTA TCC TCC TTG GTG GGC ACA TCT CAG AAT TTT AAT GAA GAA TTC TTA AGA CGT 
mel: ser leu ser ser leu val gly thr ser gin asn phe asn glu glu phe leu arg arg 

2162 2192 

TCT CTA AAG ACT ATA TTG ACA TAT GCT GAA GAA GAT CTG GAA TTG AGG GAA ACA ACA TTT 

ser leu lys tlir ile leu thr -tyr ala glu glu asp leu glu leu arg glu thr thr phe 



2222 2252 

CCT GAT CAG GTC CAG GAT CTG GTT TTC AAT CTC CAT ATG ATT CTT TCT GAT ACT GTG AAA 
pro asp gin val gin asp leu val phe asn leu his met ile leu ser asp thr val lys 

2282 2312 

ATG AAG GAA CAC CAG GAG GAT CCT GA& ATG TTG ATT GAT CTA ATG tac aga att gcc aag 
net lys glu his gin glu asp pro glu met leu ile asp leu met tyr arg ile ala lys 

2342 2372 

ggt tac CAG ACC TCT CCA GAT CTG CGA TTG ACC TGG TTG CAG AAC ATG GCA GGC AAG CAC 
gly tyr gin thr ser pro asp leu arg leu thr trp leu gin asn net ala gly lys bis 

Q2.4Q2 \ xxxxxxxxxxxxxxxxxx trans raeiBbrane domain xxxxxxxxxxxxxxxxxxx 

^-BrCA GAA CGA AGC AAT CAT GCT GAA GCT GCA CAG TGT CTA GTC CAC TCA GCA GCA CTT GTT 
Sisex glu arg ser asn his ala glu ala ala gin cys leu val his • ser ala ala leu val 

%lxxyxxTXxxxxTxyT ] 2492 

I UgCT GAA TAT TTG AGC ATG CTG GAG GKC CGG AAA TAT CTT CCT GTG GGA TGT GTA ACA TTT 
^Pala glu tyr leu ser net leu glu asp arg lys tyr leu pro val gly cys val thr phe 

" 2522 2552 

I™ CAG AAT ATT TCA TCT AAT GTT TTA GAA GAA TCT GOG GTC TCA GAT GAT GTG GTA TCT CCA 
rUgln asn ile ser ser asn val leu glu glu ser ala val ser asp asp val v^l ser pro 

W2582 2612 

t3 GAT GAA GAA GGT ATC TGC TCT GGA AAA TAC TTT ACT GAG TCA GGA CTT GTG GGA TTA CTG 
C3 asp glu gin gly ile cys ser gly lys tyr phe thr glu ser gly leu val gly leu leu 



2642 

GAA CAA .GCA GCT GCT 
glu gin ala ala ala 

XX ITAM xxz| 

AAA GTA CTT ATT CCT 

lys val leu ile pro 



2672 {xxxxx ZTMi xxxxj | 
TTC TCT ATG GCT GGC ATG TAT GAA GCA GTT AAT GAA GTT TAC 
•phe ser met ala gly net tyr glu ala val asn glu val tyr 

2732 

CAT GAA GCT AAT CGG GAT GCA AAG AAA CTA TCC ACA ATT CAT 

his glu ala asn arg asp ala lys lys leu ser thr ile his 



2762 2792 

GGT AAA CTT CAA GAA GCA TTC AGC AAA ATT GTT CAT CAG AGT ACT GGC TGG GAG CGG ATG 
gly lys leu gin glu ala phe ser lys ile val his gin ser thr gly trp glu arg met 
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2822 |xxxxx XTAM rxxx| 2852 

TTT GGC ACC TAT TTT CGT GTT GGT TTT TAT GGA ACC AAG TTC GGG GAT TTG GAT GAA CAA 

phe gly thr tyr phe arg yal gly phe tyr gly thr lys phe gly asp leu asp glu gin 
2882 2912 

GAA TTT GTT TAG AAG GAG CCT GCA ATA ACC AAA CTT GCA GAG ATA TCT CAC AGA TTG GAG 

glu phe val tyr lys glu pro ala ile thr lys leu ala glu ile ser his arg leu glu 
2942 2972 

GGA TTT TAC GGA GAA AGA TTT GGA GAG GAT GTG GTT GAA GTA ATC AAA GAC TCT AAT CCT 
gly phe tyr gly glu arg phe gly glu asp val val glu val ile lys asp ser asn pro 

3002 3032|xxxxx ITAM rxxx| 

GTA GAC AAG TGT AAA TTA GAT CCT AAC AAG GCA TAT ATT CAG ATT ACC TAT GTG GAG CCA 

val asp lys cys lys leu asp pro asn lys ala tyr ile gin ile thr tyr val glu pro 
3062 3092 

TAC TTT GAC ACA TAT GAG ATG AAG GAC AGA ATC ACC ^EAT TTC GAC AAA AAT TAC AAT CTT 
tyr phe asp thr tyr glu net lys asp arg ile thr tyr phe asp lys asn tyr asn leu 

Hfl22 3152 

HGGT CGA TTC ATG TAC TGT ACA CCC TTT ACT TTA GAT GGC CGT GCC CAT GGG GAA CTT CAT 
[grg arg phe met tyr cys tJir pro phe thr leu asp gly arg ala his gly glu leu his 

ril82 . 3212 

„pSAA CAA TTC AAA AGG AAG ACC ATT CTG ACT ACG TCT CAT GCC TTT CCT TAT ATT AAA ^CA 

Cfglu gin phe lys arg lys thr ile leu thr thr ser his ala phe pro tyr ile lys thr 

j==:53242 3272 \ XTTnnmcn^it^^^^^^^y^^ 

[1>GG GTC AAT GTC ACT CAT AAA GAA GAG ATC ATC TTA ACA CCA ATT GAA GTT GCT ATT GAG 
aarg val asn val thr his lys glu glu ile ile leu thr pro ile glu val ala ile glu 



r 



|XXXXX X 3 rTXXTX7CXTTyTXX XXXXXXXXX Coiled coil I ^-xmnTrTrmmtnn ^ yr KTr K-m M * i » t 
^GAC ATG CAG AAA AAG ACA CAG GAG TTG GCA TTT GCA ACA CAT CAG GAT CCC GCA GAC CCC 
asp met gin lys lya thr gin glu leu ala phe ala thr his gin asp pro ala asp pro 

nnxn-^xmn^n-m-x-mnrmnn^ ] 3392 

AAA ATG CTT CAG ATG GTA CTC CAG GGA TCT GTA GGC ACC ACA GTG AAT CAG GGG CCT TTG 
lys met leu gin met val leu gin gly ser val gly thr thr val asn gin gly pro leu 

34i2 3452 

GAA GTT GCC CAG GTT TTT CTG TCT GAA ATA CCT AGT GAC CCA AAG CTC TTC AGA CAT CAT 

glu val ala gin val phe lea ser glu ile pro ser asp pro lys leu phe arg his his 

3482 3512 Ixxxxxxxxxxx 

AAT AAA CTG CGA CTC TGC TTT AAA GAT TTT ACT AAA AGG TGT GAA GAT GCC TTA AGA AAA 
asn lys leu arg leu cys phe lys asp phe thr lys arg cys glu asp ala leu arg lys 



1 



FIG. 1 
5 of 6 



Coiled coil 2 xxrxxxxxxxxxxxxxxrxxxxxxxxxxxx 
AAT^G AGC TTA ATT GGG CCG GTT CAA AAG GAG TAT CAA AGG GAA TTG GGG AAA CTA TCT 

asn lys ser leu ile gly pro val gin lys glu tyr gin arg glu leu gly lys leu ser 



3602 3632 

TCG CCT TAA AGA GGC CCT ACA GCC CTA GAT CAC AGA AAG TCC CTC AGT TAT CCA AGC CAG 

ser pro STP 

3662 ^^^^ 

TAT TGC TTG TCC CCT GCC ACA GAG ATT CCT TCA GTC GAA TGA GCT TTC GCA AAA TGG ATC 

3722 ^'^^^ 

TCT AAA CTG AAT GCA CTT GTT TTA TTC ATC TGC AAA GAG CC& TGT ATT CAA CAT CGA GTG 

3782 3812 

TGA AAA GAT CTA TTG GAA ACC AAC ATG GAA TGG AAT TCT GGA AAT TAT TAT TCA TTG AAG 

3842 ^^"'^ 

AAT GCA GTG GCC AAG AAA ATA TCA AAT GTA GAT TGT TAA CGC TTG AGA ATC ATG GCT ATG 

dfo TCT AAT GOT CGG GTA ACA AGC TGT TAT CTT TTA AGA CAT TTT AAT GAC TCA AAG GTA 



i^§62 3992 

die TAT ACA TTT ACC ATT ATT TAT ACC ATA GCT AAG GTT AAA AAT TTA TTC ACT TTA AGT 



±022 



^IfcG TBT TTT TTA ATT TAT ATC ACC ATT TAT AGA TTC ATT TTG GAC CCA TTT TAA ATG TAG 



|:082 ^^^2 

$AA TGC TTA TTT TAA AGG TAG TAA AAA ATA TGT GAA TGT TTA CCT CGT GCG CGC CAG GGC 



I' 



■:ii42 

;tTC 
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HC2A 
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HC2A 
KIAA 
rat 
HC4 

;:iic5 

I. = 

;:hc2a 
fikiAA 

-r-^at 

CrHC4 

. HCl 

i.iHC3 
llHCS 

Ei^IAA 
C3rat 
HC4 
HCl 
HC3 
HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



FPJW^SSyiPTKQFETCSKTPITraVEEFVPCIPKHTQPyTIYTKHLYVyPKYL^ 



. VLHHHQNPEFYDEIK 

KS FAKARNIAI C lEFKDSDEEDS QPLKC 1 YGRPGGPVFTRSAfAAVLHHHQNPErYDE I K 



lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 
IEIJ^TQLHEKHHI^TFraVSCDNSSKGSTKKRDVVETQVGySWIJ>IJJ^^ 



PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHIjVSTVYTQDQHLHNFFQYC 
PVSANIJ>SGyLGYQELGMGRHYGPEIKWVI)GGKPlJiKISTHI.VSTVYTQW 



QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPT I LNQLrFRVLT-RATQEEVAVNVTRV 
QKTESGAQALGNELVKYXiKSIiHAMEGHVMIAFIJ>TILNQLFRVLT-KA 



MEIQVLIRFLSVILMQLFWVLPNMIHEDDVPISCPMV 

: MS FLP 1 1 LNQL FKVLV- QNEEDE I T TTVTRV 

NRSRSLSNSNPDISGTPTSPDDEVRSIIGSKGLDR5NSWVNTGGPKAAPWGSNPSPSAES 
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HC2J\ 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



IIHWAQC 
IliJWAQCH: 




ESHLRS YVKYAYKAE P YVASE YKTVHEELT 
SSHIJ^YVKYAyKAEPYVAS£yKTVHEEI.Tr 



m 



ILKPSADFLTSN 
IltKPSADKLTS^I 



LFHIVSKCHEEGLDSYLSSFIKYSFRPGKPSAPQAPLIHETLATMMIALLKQSADFLAIN 

LPDIVAKCHEEQLDHSVQSYIKFVFKTR ACKERPVHEDIJUCtJVTGLLK-SNDSPTVK 

TQAMDRSCNRMSSHTETSSFLQTLTGRLP TKKLFHEELALQWWCSG — SVR E 



Cadherin 
Cleavage 



KLLRYSWFFFDVLIKSMAQHLIENSKVKLl RNQP FPASYHHAAETWNMLMPHITQKFGD 
KLLKYSWFFFDVLIKSMAQHLIENSKVKLI RNQF FPASYHflAVETVV^^ 



SAIiQQAWFFFEIi^VKSMVHHLYFMDKLE 



KLIJCYSWFFFEI lAKSMATYXLEENKIKLTjHGQPiFPKAYHHALHSLFLAIT- IVESQYAE 
HVLKHSWFFFAI IIJCSMAQHLIDTNKIQLt RPQP FPESYQNELDNXiVMVLS^ 



:A5RK£ 



SPFPERFMDDIAALVSTIASDIVSRFQK 



HC2A 
KIAA 
rat 
HC4 

HCl 
HC3 
HC5 

D 

HftiC2A 

''4CIAA 

IJrat 

KUCA 

fliHCl 

^^gHC3 

f ?%HC5 



tUHC2A 
i^-KIAA 
idrat 
pHC4 
gHCl 
"HC3 
HC5 



NPEASKblANHSLAVFIKRCFTMDRGFVFKQIlI NYIS — CFAPGDPKTLFEYKFEFL 

NPEASKKANBSIxAVFlKRCFTJMDRGFVTKQIN NYIS — CFAPGDPKTLFEYKFEFL 

IPKESRNVNYSIASFIiKCCIiTIMDRGFVTNLIN DYIS — GFSPKDPKVLAEYKFEFL 

ALEE TRRATHS VARFLKRCFTFHDRGCVFKMVN NYIS — MFSS GDLKTLCQYKFDFL 

DTEMVERLNTSIJ^FLNDLI.SVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVLVSLRI.DFL 



RWCNHEHYI PLNLPM- 
RWCNHEHYIPLNXlPM- 



-PFGKGRIQR' 
•PFGKGRIQR- 



YQDLQL DYSLTDEF 

YQDLQL DYSlaT-DILF 



QT I CNHEHY I PLNLPM AFAKPKLQR- — — VQDSNL E YS LSDE Y 

QEVCQHEHFI PLCLP I RSANI PDPLTPSES TQELHAS tMPE YS VTNE F 

RIICSHEHYVTLNLPCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELS — VPF 
—MNADTAPTSPCPSIS SQNSSSCSSFQDQKIASMFDRTSRVPA 



Cadherin 
EC motif 



CRNHFLVGL LLRE VGTALQEFRE — 
CRNHFLVGL LLRE VGTALQEFRE — 



— VRLIAISVLKNLLIKHSFDDRYASRSHQARIAT 
— VRLIAISVLKNLLIKHSFDDRYASRSHQARIAT 



CKHHFLVGL LLRE TSIALQDNYE IRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQ 

CRKHFLIGI LLRE VGFALQEDQD VRHXJOAVLKNLMAKHSFDDRYREFRKQAQIAS 

RQQHYLAGL 7LTE LAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
SSTS-SPGL LFTE LAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAA 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



LYLPLFGLL I ENVQRINVRDVS PFPVNAG-MTVKDESLALPAVNPLVTPQKGS TLDNSLH 
LyiJ>LFGIiLI£NVQRIKVRDVSPFPVNAG-MTVKDESL^^ TLDNSLH 

LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP — AN — RGSLS 

LYMPLYGMLLDKMPRIYLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFS 

LYLPLIGIIMETVPQLYDFTETHNQRGRPXCIATDDYESE SG SMIS 

LYLPLVG 1 1 LDALPQLCDFtVADTRRYR TSGS DEEQE GA GAI T 
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KIAA 
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HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 

^"fec2A 
^KXAA 
i.^rat 

!:1hc4 

«^HC1 
^^HC3 
^;hHC5 

j,s5=HC2A 
pyKIAA 

L,HC4 
S HCl 
yHC3 
HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



KDLLGAISGIA^YTTSTPNINSVRNADSRGSLISTDSGNSLPERK^^NSLDKHQQSS 
}^lAjGPaSGl^^LT7ST?m}^SVBN?d)SRGSLlSTDSG^ 



TDKDTAYGSFQNG HG I KREDSRGSL I P-EGATGFPDQGNTGEN TRQS 

KDVLNS lAAFS S lAIS TVNHADS RAS LAS LDS NPS TNEKS S EKTDNC EK I PRP L 

QTVAMAIAGTSVPQ • LTRPGSFLLTSTSGRQHT 

QNVALAIAGNNFN LKTSG- IVLSSLPYKQYN 



TLGNSWRCDKLDQSE I KSLLMC FLY I LKSMSDDALFT YWN-KAS TSELMDFFTI SEVCL 
TIK^JS VVRCBKIJXSSEIKSIJ^CFXYIIJCSMSDDALrT YTO 

STRSSVSQYIJRLDQYEIRSLLMCYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 
AL I GS TLRFDRLIX2AETRSLLMCFLH IMKT I S YE TLIAYWQ-RAPS PEVSDFFS I LDVCL 

T FSAESSRS LL I CLL WVLKN-ADE TVLQKW FTDLS VLQLNRLLDLL YLCV 

MLNADTTRNLMICFLWIMKN-ADQSLIRKWIADLPSTQLNRILDLLFICV 



HQFQYMGKRYIARNQEGI/G — PIVHDRKS — QTLPVSRNRTGMM 

HQFQ YMGKR YIAR tgmm 



FHFRYMGKRNIARVHDAWLSKHFGI DRKS QTMP/ULRNRSGVM 
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2 32 

CGA GTA AAT CGT TCT CGA AGC CTT AGT AAT AGC AAT CCA GAT ATA TCT GGG ACT CCC ACG 



62 92 

TCA CCA GAT GAT GAA GTT CGA .TCA ATC ATC GGG AGT AAG GGT TTA GAT CGC TCC AAT TCC 
122 . 152 

TGG GTT AAC ACT GGT GGT CCA AAA GCT GCC CCA TGG GGA TCC AAC CCC AGT CCA AGT GCA 
182 212 

G7^ TCA ACA CAG GCT ATG GAT CGA AGT TGT AAT CGT ATG TCT TCG CAC ACA GAG ACG TCA 

met asp arg ser cys asn arg met ser ser his thr glu thr ser 

242 272 

AGT TTC TTA CAA ACA TTA ACG GGA CGC TTA CCA ACT AAA AAG CTT TTT CAC GAG GAG CTG 
ser phe leu gin thr leu thr gly arg leu pro thr lys lys leu phe his glu glu leu 

302 332 

,GpT TTG CAG TGG GTT GTT TGG AGT GGC AGC GTT CGG GAA TCA GCT TTG CAA CAA GCC TGG 
^a leu gin trp val val cys ser gly ser val. arg glu ser ala leu gin gin ala trp 

ref 1,1 and 1-2 | 
^62 ref 2.1 and 2,2 ^ 392 

t?|rC TTT TTT GAA TTA ATG GTA AAG AGC ATG GTG CAC CAT TTA TAG TTT AAT GAT AAA CTT 
'^e phe phe glu leu met val lys ser met val his his leu tyr phe asn asp lys leu 

^J21 452 

J^BaG- GCT CCA AGG AAA AGT CGT TTT CCA GAA CGT TTC ATG GAT GAC ATT GCA GCT CTT GTC 
"glu ala pro arg lys ser arg phe pro glu arg phe met asp asp ile ala ala leu val 

K82 512 

fiiGC ACG ATT GCT AGT GAT ATA GTT TCA CGA TTT CAG AAG GAC ACA GAA ATG GTT GAG AGA 
«:ser thr ile ala ser asp ile val ser arg phe gin lys asp thr glu met val glu arg 

f442 572 

pirTC AAT ACA AGC CTT GCA TTC TTT CTC AAT GAT CTG TTG TCT GTT ATG GAC AGA GGA TTT 
'^ieu asn thr ser leu ala phe phe leu asn asp leu leu ser val met asp arg gly phe 

602 632 

GTT TTT AGC CTT ATA AAG TCC TGC TAT AAA CAG GTG TCT TCA AAG CTT TAG TCA TTA CCG 
val phe ser leu ile lys ser cys tyr lys gin val ser ser lys leu tyr ser leu pro 

J ref 3,1 and 3,2 
692 
FG TCC TTG AGG CTG GAT TTT CTA CGA ATC ATC TGC AGT CAT GAG 
asn pro ser val leu val ser leu arg leu asp phe leu arg ile ile cys ser his glu 



722 752 

CAC TAT GTT ACA TTA AAC TTA CCC TGC AGC TTA CTT ACT CCA CCT GCA TCT CCA. TCA CCT 
his tyr val thr leu asn leu pro cys ser leu leu thr pro pro ala ser pro ser pro 

J ref 4.1 and 4,2 
812 

TCT GTT TCT TCT GCA ACA TCT CAG AGT TCT GGA TTT TCT ACG AAT GTA CAA GAC CAA AAG 
ser val ser ser ala thr ser gin ser ser gly phe ser thr asn val gin asp gin lys 
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842 872 

ATT GCA AAT ATG TTT GAA TTA TCC GTG CCT TTC CGC CAA CAG CAT TAT TTG GCA GGA CTT 
ile ala asn met phe glu leu ser val pro phe arg gin gin his tyr leu ala gly leu 

902 932 

GTG TTA ACA GAG CTG GCT GTC ATT TTA gac cct gat get gaa gga ctg TTT GGA TTG CAT 
val leu thr glu leu ala val ile leu asp pro asp ala glu gly leu phe gly leu his 

962 992 

AAG AAA GTC ATC AAT ATG GTA CAC AAT TTA CTC TCC AGT CAC GAC TCA GAC CCG CGG TAC 
iys lys val ile asn met val his asn . leu leu ser ser his asp ser asp pro arg tyr 

1022 1052 

TCT GAC CCT CAG ATA AAG GCT CGA GTG GCC ATG TTG TAT CTA CCT CTG ATT GGT ATT ATC 
ser asp pro gin ile lys ala arg val ala met leu tyr leu pro leu ile gly ile ile 

1082 1112 

ATG GAA ACT GTA CCT CAG CTG TAT GAT TTT ACA GAA ACT CAC AAT CAA CGA GGA AGA CCA 
met glu thr val pro gin leu tyr asp phe thr glu thr his asn gin arg gly arg pro 

3ffl42 1172 

AiT TGT ATA GCC ACT GAT GAT TAT GAA AGT GAG AGC GGA AGT ATG ATA AGC CAG ACC GTT 
i-Je cys ile ala thr asp asp tyr glu ser glu ser gly ser met ile ser gin thr val 

lM02 1232 

GgC ATG GCA ATC GCA GGG ACA TCG GTC CCT CAA CTA ACA AGG CCT GGC AGT TTC CTC CTC 

ala met ala ile ala gly thr ser val pro gin leu thr arg pro gly ser phe leu leu 
«H |ref5,l and 5.2 

p62 I 1292 

^CG TCA ACG^AGT GGC AGG CAA CAC ACT ACC TTT TCA GCA GAA TCA AGT CGA AGC CTT TTG 

tJtir ser thr ser gly arg gin his thr thr phe ser ala glu ser ser arg ser leu leu 



pf322 1352 

hie TGT CTA CTT TGG GTT CTC AAA AAT GCA GAT GAA ACA GTT CXA CAG AAG TGG TTT ACA 
'Me cys leu leu trp val leu lys asn ala asp glu thr val leu gin lys trp phe thr 

111382 1412 

GAT CTC TCA GTC TTG CAG CTA AAC CGG CTA TTA GAT CTG CTT TAT CTC TGT GTG TCT TGC 
asp leu ser val leu gin leu asn arg leu leu asp leu leu tyr leu cys val ser cys 

1442 1472 

TTT GAG TAT AAA GGG AAA AAA GTG TTT GAA CGA ATG AAT AGC TTG ACC TTT AAG AAA TCA 
phe glu tyr lys gly lys lys val phe glu arg met asn ser leu thr phe lys lys ser 

1502 . 1532 

AAA GAC ATG AGA GCA AAG CTT GAA GAA GCT ATT CTT GGG AGC ATA GGT GCC AGG CAA GAA 
lys asp met arg ala lys leu glu glu ala ile leu gly ser ile gly ala arg gin glu 

Iref 6.1 and 6.2 
1562 4 1592 

ATG GTA CGG CGA AGC CGA GGA CAG CTC GAG AGA AGC CCA TCT GGA AGT GCC TTT GGA AGT 
met val arg arg ser arg gly gin leu glu arg ser pro ser gly ser ala phe gly ser 



1622 1652 

CAA GAA AAT TTG AGG TGG AGG AAA GAT ATG ACT CAC TGG CGT CAA AAC ACA GAG AAG CTT 
gin glu asn leu arg trp arg lys asp met thr his trp arg gin asn thr glu lys leu 
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1682 1712 

GAC AAA TCA AGA GCA GAG* ATT GAA CAC GAA GCA CTG ATT GAT GGA AAC CTG GCT ACA GAA 
asp iys ser arg ala glu ile glu his glu ala leu ile asp gly asn leu ala thr glu 

1742 1772 

GCA AAC OTA ATC ATT TTA GAT ACA TTA GAG ATT GTT GTT CAG ACC GTT TCT GTA ACG GAA 
ala asn leu ile ile leu asp thr leu glu ile val val gin thr val ser val thr glu 

1802. 1832 

TCC AAA GAG AGC ATT CTT GGT GGA GTG CTA AAA GTG CTA CTA CAC AGC ATG GCC TGT AAC 

ser lys glu ser ile leu gly gly val leu lys val leu leu his ser met ala cys asn 

ref 7.1 and 7,2 

1862 1892 

CAA AGT GCA GTT TAT CTA CAA CAC TGT TTT GCT ACA CAG AGA GCC TTG GTT* TCA AAG TlTT 

gin ser ala val tyr leu gin his cys phe ala thr gin arg ala leu val ser lys phe 



1922 1952 

CCT GAA CTC TTA TTT GAA GAA GAG ACA GAG CAG TGT GCT GAT TTA TGC CTC AGG CTT CTC 
pro glu leu leu phe glu glu glu thr glu gin cys ala asp leu cys leu arg leu leu 

1^.82 2012 

CAC TGT AGC AGT AGC ATC GGT ACA ATA CGG TCA CAC CCC AGT GCC TCC CTT TAC CTA 
his cys ser ser ser ile gly thr ile arg ser his pro ser ala ser leu tyr leu 



^042 2072 

e3iA ATG AGG CAA AAC TTT GAG ATT GGG AAT AAC TTT GCC AGG GTT AAA ATG CAG GTA CCA 
igu met arg gin asn phe glu ile gly asn asn phe ala arg val lys met gin val pro 

S02 2132 

Ai G TCA CTA . TCC TCC TTG GTG GGC ACA TCT CAG AAT TTT AAT GAA GAA TTC TTA AGA CGT 
liet ser leu ser ser leu val gly thr ser gin asn phe asn glu glu phe leu arg arg 

Mi62 2192 

PET CTA AAG ACT ATA TTG ACA TAT GCT GAA GAA GAT CTG GAA TTG AGG GAA ACA ACA TTT 
ser leu lys thr ile leu thr tyr ala glu glu asp leu glu leu arg glu thr thr phe 

;^22 2252 

■CCT GAT CAG GTC CAG SAT CTG GTT TTC AAT CTC CAT ATG ATT CTT TCT GAT ACT GTG AAA 
pro asp gin val gin asp leu val phe asn leu his met ile leu ser asp thr val lys 

2282 2312 

ATG AAG GAA CAC CAG GAG GAT CCT GAA ATG TTG ATT GAT CTA ATG tac aga att gcc aag 
met lys glu his gin glu asp pro glu met leu ile asp leu met tyr arg ile ala lys 

2342 2372 

ggt tac CAG ACC TCT CCA GAT CTG CGA TTG ACC TGG TTG CAG AAC ATG GCA GGC A7U3 CAC 
gly tyr gin thr ser pro asp leu arg leu thr trp leu gin asn met ala gly lys his 

2402 2432 

TCA GAA CGA AGC AAT CAT GCT GAA GCT GCA CAG TGT CTA GTC CAC TCA GCA GCA CTT GTT 
ser glu arg ser asn his ala glu ala ala gin cys leu val his ser ala ala leu val 

2462 2492 

GCT GAA TAT TTG AGC ATG CTG GAG GAC CGG AAA TAT- CTT CCT GTG GGA TGT GTA ACA TTT 
ala glu tyr leu ser met leu glu asp arg lys tyr leu pro val gly cys val thr phe 
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ref 8-1 and 8.2 

2552 

CAG AAT ATT TCA TCT AAT GTT TTA G?VA GAA TCT GCG GTC TCA GAT GAT GTG GTA TCT CCA 
gin asn ile ser ser asn val leu glu glu ser ala val ser asp asp val -val ser pro 



2582 2612 

GAT GAA GAA GGT ATC TGC TCT GGA AAA TAG TTT ACT GAG TCA GGA CTT GTG GGA TTA CTG 
asp glu glu gly ile cys ser gly lys tyr phe thr glu ser gly leu val gly leu leu 

2642 2672 

GAA. CAA GCA GCT GCT TCC TTC TCT ATG GCT GGC ATG TAT GAA GCA GTT AAT GAA GTT . XAC 
glu gin ala ala ala ser phe ser met ala gly met tyr glu ala val asn glu val tyr 



u 

i 



2702 2732 

AAA GTA CTT ATT CCT ATT CAT GAA GCT AAT CGG GAT GCA AAG AAA CTA TCC ACA ATT CAT 
lys val leu ile pro ile his glu ala asn arg asp ala Ivs lys leu ser thr ile his 

Iref 9.1 

2762 2792 ♦ 

GGT AAA CTT CAA GAA GCA TTC AGC AAA ATT GTT CAT CAG AGT ACT GGC TGG GAG CGG ATG 
gly lys leu gin glu ala phe ser lys ile val his gin ser thr gly trp glu arg met 

k§22 2852 

iff T GGC ACC TAT TTT CGT GTT GGT TTT TAT GGA ACC AAG TTC GGG GAT TTG GAT GAA CAA 
piiB gly thr tyr phe arg val gly phe, tyr gly thr lys phe gly asp leu asp glu gin 
L-U ' , 

2.882 2912 

TTT GTT TAC AAG GAG CCT GCA ATA ACC AAA CTT GCA GAG ATA TCT CAC AGA TTG GAG 
jgLu phe val tyr lys glu pro ala ile thr lys leu ala glu ile ser his arg leu glu 
ref 10.1 and 10.2 

2945 2972 
*-GGA TTT TAC GGA GAA AGA TTT GGA GAG GAT GTG GTT GAA GTA ATC AAA GAC TCT AAT CCT 
^'gly phe tyr gly glu arg phe gly glu asp val val glu val ile lys asp ser asn pro 

f y 

|=a002 3032 . 

|.3gTA GAC AAG TGT J\AA TTA GAT CCT AAC AAG GCA TAT ATT CAG ATT ACC TAT GTG GAG CCA 
f^^al asp lys cys lys leu asp pro asn lys ala tyr ile gin ile thr tyr val glu pro 

:4o62 3092 
XAC TTT GAC ACA TAT GAG ATG AAG GAC AGA ATC ACC TAT TTC GAC AAA AAT TAC AAT CTT 
tyr phe asp thr tyr glu met lys asp arg ile thr tyr phe asp lys asn tyr asn leu 

3122 3152 

CGT CGA TTC ATG TAC TGT ACA CCC TTT ACT TXA GAT GGC CGT GCC CAT GGG GAA CTT CAT 
arg arg phe met tyr cys thr pro phe thr leu asp gly arg ala his gly glu leu his 

3182 3212 

GAA CAA TTC AAA AGG AAG ACC ATT CTG ACT ACG TCT CAT GCC TTT CCT TAT ATT AAA ACA 
glu gin phe lys arg lys thr ile leu thr thr ser his ala phe pro tyr ile lys thr 

I ref 11,1 

3242 £ 3272 

AGG GTC AAT GTC ACT CAT AAA GAA GAG ATC ATC TTA ACA CGA ATT GAA GTT GCT ATT GAG 
arg val asn val thr his lys glu glu ile ile leu thr pro ile glu val ala ile glu 
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3302 3332 

GAC ATG CAG AAA AAG ACA CAG GAG TTG GCA TTT GCA ACA CAT CAG GAT CCC GCA GAC CCC 
asp met gin lys lys thr gin glu leu ala phe ala thr his gin asp pro ala asp pro 

33 62 3392 

AAA ATG CTT CAG ATG GTA CTC CAG GGA TCT GTA GGC ACC ACA GTG AAT CAG GGG CCT TTG 
lys met leu gin met val leu gin gly ser val gly thr thr val asn gin gly pro leu 

■ 3422 3452 

GAA GTT GCC CAG GTT TTT CTG TCT GAA ATA CCT AGT GAC CCA AAG CTC TTC AGA CAT CAT 
glu val ala gin val phe leu ser glu ile pro ser asp pro lys leu phe arg his his 

3482 ' 3512 

AAT AAA CTG CGA CTC TGC TTT AAA GAT TTT ACT AAA AGG TGT GAA GAT GCC TTA AGA AAA 
asn lys leu arg leu cys phe lys asp phe thr lys arg cys glu asp ala leu arg lys 

3542 3572 

AAT AAG AGC TTA ATT GGG CCG GTT CAA AAG GAG TAT CAA AGG GAA TTG GGG AAA CXA TCT 
asn lys ser leu ile gly pro val gin lys glu tyr gin arg glu leu gly lys leu ser 

3602 3632 

|gG CCT TAA AGA GGC CCT ACA GCC CTA GAT CAC AGA AAG TCC CTC AGT TAT CCA AGC CAG 
5gr pro OCH 

p62 3692 

i4T TGC TTG TCC CCT GCC ACA GAG ATT CCT TCA GTC GAA TGA GCT TTC GCA AAA TGG ATC 



22 3752 
.fCT AAA CTG AAT GCA CTT GTT TTA TTC ATC TGC AAA GAG CCA TGT ATT CAA CAT CGA GTG 



j3782 3812 

I^IGA AAA GAT CTA TTG GAA ACC AAC ATG GAA TGG AAT TCT GGA AAT TAT TAT TCA TTG AAG 
:5842 3872 

;'4aT GCA GTG GCC AAG AAA ATA TCA AAT GTA GAT TGT TAA CGC TTG AGA ATC ATG GCT ATG 
□ 902 3932 

[pTT TCT AAT GTT CGG GTA ACA AGC TGT TAT CTT TTA AGA CAT TTT AAT GAC TCA AAG GTA 
3962 3992 

CAC TAT ACA TTT ACC ATT ATT TAT ACC ATA GCT AAG GTT. AAA AAT TXA TTC ACT TTA AGT 



4022 4052 

TCG TAT TTT TTA ATT TAT ATC ACC ATT TAT AGA. TTC ATT TTG GAC CCA TTT TAA ATG TAG 

Jref 12.1 

TAA TGC TTA TTT TAA AGG TAG TAA AAA ATA TGT GAA TGT TTA CCT CGT GCG CGC CAG GGC 

4142 
CTC 
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Refl.l 

Sequence of BAC8 using primer C3S3, which spans nucleotides 340-359 of the cDNA. Exon 
sequence is underlined and represents nucleotides 364-380. 

m i l l G AATTA ATGGT G AGCAAAAACTG AGCATGTTCTTT AATA 1 1 ' IM 1 ' I CTrTT AnXO 
AACAA 11 ■ I ■ 1 ATGCTAGCTCATTTGTTACCTTAG AAATC 1' 11 ' I " I CTGTTGCACATCTTAAC 
GC 1" r 1 ■ 1 CCATGTGCCTCTAAGACAAAATTAC ATGTGTTAC ATCTCTA AATAA ACACTGT 
GG ACACTCAACAC^GTTTAGGTGGAATTAAG AGTG AGGCTCATTTTA ACrCTTA' I'l ' i ' IC 
TQ\GGGATGGTTGCATAAGCTAGCTATATTTTC::AAAGGAAACnTGTGATACA 14'C 1'1'G 
CrAGTO\TTATACATGAAGTGTATAATGACAGTATTGTAGATTTTATACCAAAGATGG 
AAAGAGCTTTATAGATACCCACTGCrATTGTTATGGCTAGTAAACCCTTAGGGAAATG 
CCAGTTACAATCAATAAAAAAACAACAGTCTGGGTGGGTGCAGTGGCTCACACCTGTA 
^TCTCAGCACITTAGAAGGCCGAGGCAGGAGGATCACTrGAGATCAGGAGTTTGAGAC 
IllAGCGrGGGCAACATAGCAAGAGCCCATATT^ACCCAAAAAAAA 1 ■ 1 1 1 1 TlTl AAAT 
§iAAGCTAAAACCCTGGNNGGCCACAAAACCTGTAGTTCCCATCTACTTTGGAAAGGCT 
J^AAGGANGGGAGGGCrTGCTTTGAGCCCCAAGAANGTTCAAAGGCTNGCNGNCAC^ 
|fT^^■GATTCNACACNTGCAACTCCCGCATTGGGTNAACAAAANCCAAGGAANC. 

was; 

ji^equence of BAC9 using primer C3S3, which spans nucleotides 340-359 of the cDNA. Exon 
f||equence is underlined and represents nucleotides 371-380. 

1' 4\ATTAATGGTG AGCAAAAACTG AGCATGTTCTTT AATA ri ' i T 11 CTCTTAGTG AACAAT 
IfrrrATGCTAGCTCATrrGTTACCTTAG AAAT C 1 " 1 " 11 T CTGTTGCACATCTTAACG C 1 1 " 1 1 
tecATGTGCCTCTAAGACAAAATTACATGTGTTACATCTCTAAATAAACACTGTGGACA 
"CTCAACACAGTTTAGGTGGAATTAAGAGTGAGGCTCATFITAACrCTTATTTTCrCAGG 
G ATGGTTGC ATAAGCTAGCr AT ATTTTCAAAGG A AACTTGTG ATACA 1 1 CI ' I ' 1 GCTAGT 
CATTATACATGAAGTGTATAATGACAGTATTGTAGATTTTATACCAAAGATGGAAAGA 
GCirrATAGATACCCACTGCTATTGTTbrrGGCTAGTAAACeCrTANGGAAATGCCAGTT 
NCAATCAATAAAAAAACAACAGTACTGGCTGGGTGCAGTGGCTTACACCTGTAATCTC 
AGCACTTTATAAGGCCCNAGGCNGGAGGATCACTTNAGATCCAGGAGTTTGAGACCAG 
CCTGGGCAACATANCAAG AGCCCATTATCTACCAAAAAAN rn T 1 'l "I'l 1 TAAAATTAAG 
C^AAACNCTGGGTGGNACAAACCTGTTNG^rITCCNATNTNCCTTTGGAAAAGCTTANG 
AAGGGG AGGGCITNCTTTGG ANCCCCA AAAAGTTNAA AGGGNTTGCAGTCAGC C 1 T f i 
NAATCACCCNNNGGNCCTNTCGCATTGGGATTNCCAANANGCCAANGNAACCCCGNT 
COTNTTTTAAAAAAhTThTITTTAAAGNANhmTNTTNGN 
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Ref2.1 

Sequence of BAC8 using primer HC3AS7, which spans nucleotides 386-405 of the cDNA. Exon 
sequence is underlined and represents nucleotides 381-385. 

tcttt nngaaaaagatranattattaattcratgatatattaacaatacacanctctaa 

cacttggactatttttaaaatatggcatgtaatttaAtagatgactgaaatattt^ 

ttctcaaatattttttaaagttccctacaatgttttgtatttgcttaaaataaaatan^ 

aaaaccaccatattacrttcagaaaattatgctagctaacaataggacaaaaaattcr 

gtgtatgtc a ac aaa aaaaattc aacctt aaa i t 11 ' 1 ' i ' i " i' 1 ' 1" 1 ccata aa aaac agggc 

tacttgcccaggtgagangtgctgccgtatgagcrcctcgntagattgcgcngccgga 

hTTGTCGGNCCCTNCGTTTAATATAACGGCGTGNGCNTGTACCGCAGGCTNTGCTAGGT 

CGTG^^cccAAGATATC^^n^^^^^'ANCATANTAGACG^r^GGNGNCG^nv^^<^^ 

TNATT^r^NGCAATTGTNACAATCCTAGT^^^GTAC^^^NANAG^^'CNGCCNCTGTGA^^^ 

CGTTGTATAGTCNGNGGCNCGCTTGhm'CrGATGCrGAGAGCAOTNNCNNACTmTh^ 

NCNCCCATCnTTNCNmTNNNNNCCCCCmTTNNATNhTITT 

NTNAANNNACCNCC 

O 

Irllef 2.2 

£ ,^equence of BAC9 using primer HC3 AS7, which spans nucleotides 386-405 of the cDNA. Exon 
^--sequence is not found within this sequence. Since the primer is directed against exon sequence we 
' ypresume that sequence derived from HC3AS7 is intron sequence; Additionally, this sequnce 
l-pmatches the intron sequence found in the previous sequence (BAC8 sequenced with HC3AS7). 
^GCGCTNCCN^n^^m^^TTTATCTTCTGAAAAGACTNATATNATTC^ATGATTATAAC^ 

MCACACTCTAACACTGGACTtNTTAAATATGGATGTAATTAATAGATGACTGAATATTTT 
I^UGCTTCTCAAATATmTTAANGTCCCTACAATGTTTGNAThrrGCIT 

{ y AAACCCCATATTACTTTCAG AAAANTATGCTAGCTAACAATAGGACANAAAAATTCTG 
O TGTATGCAACAAA AAA AAATTC AACCTTN AATC 1 ' 1 C 1" 1 ' i " 1 ' 1 ' 1 " 11 CCAATAN AA AACAGG 
OGCTACTCTGCCACAGGCrGGAGTCAGTGGCrGATAO\GCTACTGCAGCTCACrCCGG^ 
CTATGTGATTGCCTGCCTAAGCCTCNGAGTAGTAGGCTCAGGTGCCACrACATGCCAG 
TAATCTAAAATTTATAGAGACAGGGCTGCTGTGTGNCCAGGCTGGCTAACTCCGGGCT 
AAGCGTTCTTGCCTNGCTCTAAATGTGGGATACAGNATGTATCATNCATCAGCCAAAA 
AGTTAATTAA^m'CCAGATNAm'ATT^GCATCAAAGCTCCAATNTAGCTTGAAGTAGA 
ACCTGCTCKITGGCTAGAhrrATCCCGNKrGTTATGGATCATATTANGCNNTTGTGATGC 
CGAATGGNATCTATTCCGGGAGACANATTACTATNGGATGANAGCANATNGCCCNNAT 
GCn'mTTGTAACGCTT^ANmAAGAAO^CrmGACATCGTCATAGmCGfi^GTNm' 
^^^lGCGANTTGATAC^AANT^CATG^^'ANGCCNATGACT^^•NGTGATT^rNTGANTGNCT 
GGGAGAACCTACNri>lCCCCm"ACNNATAhWCTNCACCCCCrACTACrNTNNCNNTCNC 
TCTCTA^^TCTACTCCAC^^ITATTATCCTCNNCCTTCNCATC^^^CCCATC^™A^^ 

GCCNCNANACTTANCNTTNATNCACTCTNNCT 
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Ref 3.1 

Sequence of BAC8 using primer C3AS4, which spans nucleotides 737-756 of the cDNA. Exon 
sequence is underlined and represents nucleotides 677-733. 

TAATGTACATAGTGCrCATGACTGCAGATGATTCGTAGAAAATCCAGCCrCAAGGACA 
CCAGAACACTGGGATTCGGTAATGAGTAAAGCTTTGAAGACACCTTGTAAGCAATGCA 
T AAGTA AG AG AA CACCAA1TG AATCTATTA I' i ' 1 Ci ' I' 1 AATACTAATACCAGAATGGC A 
AATTAGAATTAAj\GAGATAGTACTTGGTATCCAGTTTGGGTTTTGTGGCTTAAGTAGCA 
GTATCACCTTTtTCCAGAGTTACTGCTAAAATTAAAAATTTTAAACTATCAGGTTTA 
GTATAAACATATTTGACTAACCTAAAAGCCACATTCTTGTATTTCCAATATAGCATCAA 
TATTTCTACTTCTCATAAAACAGGGAAAACGTATATCACCAAAAATAACTTC^ 
TTCCnrCTTAAAAGAAATTATCAATTCITTTTATAGCACnTTGTG 
AATTTGTCTGTITTCrCAGCAACATCATAAGCrACrTGAGGAGACA 
TTTAACAGCTTTAGTGTCCCrACAGCTTAGCTCAATGTTTGACAAATATAC^ 
TGCTTAAAGGAATAAAGGCCAGGACAAGTTCTGGTAGCAAATAGTGCATAAAAGGTTT 
TGGGGiGAAAAGGGTAAAAATGGATACATATCGGGGTNGCAAGmTTTTCCATGTGGG 
JGTGAGGTGCCCCATGCCTT 

-.ySequence of BAC9 using primer C3AS4, which spans nucleotides 737-756 of the cDN A. Exon 
,^isequence is underlined and represents nucleotides 677-730. 

jj TAACATAGTGCTCTGACTGCAGATGATTCGTAGAAAATCCAGCCrCAAGGACAC CAGA 
hACACTGGGATTCGGTAATGAGTAAAGCTTTGAAGACACCTTGTAAGCAATGCATAAGT 
AAGAGAACACCAATTGAATCTA1TA 1 r 1 C i " 1 " 1 AATACTAATACCAG AATGGCAAATTA 
nGAATTAAAGAGATAGTACTTGGTATCCAGTTTGGGTTTTGTGGCTTAAGTAGCAGTATC 
I ACC 11 r n CCAGAGTTACTGCTAAAATTAA AAATTTTAAACTATC AGGTTTACTGTAT A 
y AACATATTTGACrAACCTAAAAGCCACATTCTTGTATTTCCAATATAGCATC^ 
= CTACTTCTCATAAAACAGGGAAAACGTATOTCACCAAAAATAACTTCITATTAC^ 
TCTTAAAAAGAAATTATCAATTCITTTTATAGCACnriTGTGCTTACCCrGNATT^ 
TTGNCTG^^^TTTCTCAGCAAACATCATAAGCTACTTGAGGGAGACATAC^ATTAAACCT 
GATTACAGCTTTTANGTGTCCCTACAGCITAACTCAATGTTTTGCAAAATOThn^ 
TCAATGGCTTTAAAGAATAAAAGANCAGGGACAAGTTNTGGGTNGCCATNAGNACAA 
TAAAGGTTTTNGGGGGAAAAGGGAAAAAATNGATTNCATWCGNGGTTNGCAAGGTN 
TTTTCCATTGNGGGGNGGAGGGGCCCATGCCATAA^^ITITAACCll'lCI■i■l■i"l■lN 
AAATTAAACNNTTAAAGGGGTN 

Ref 4.1 

Sequence of BAC8 using primer HC3AS6, which spans nucleotides 924-944 of the cDNA. Exon 
sequence is underlined and represents nucleotides 802-917. 
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CCAGTCrC.rAATATGCTGTGCGAAGCCGATATCAACTTTGCATCTTTGTCTTGNCATTC 

GAGAAATCAGACTTGTGGAAGTAGGAGACAGCTTAC AGCGTGCACAAGCTCTCAGCA 

GAGCATATACGAATGAATCTTTTCCAGGGAGTTATTTATATACTACCTGAGCAAGCCA 

CITTAGCTTTGGGCAGGAACITNrrGGATGTTATAAGTAATACrTATATGAATAATATGA 

AATT AATATTTACTTC 1 1 ' 1 ' I ACA NTCTTCTC i " 1 " I ' I CCTTATCTTAGCCTTTATCCCCTTGT 

GGAAAAGACACTATCAATGCTAGATNCTCCCAAGNCAGAGAATTATGCAGGTTTGGTC 

AGAGAATCGACACAGACATGTTTACAGATTCTTCTTGAAATACATATTGTGCACGAGT 

TTTTTACA^^^ATC^CAATTTAGATCTC^GACAGCAT^r™GACTAGNGGGTCTAGG 

AGATACATKTTTGNGAACrTCTATAGAANAAChrrm"GCKITAAAAAGGAGC^ 

ANANGAATN^WC^GNGAAGGGCCCGATACGANAATTTGACITC.GGNGAAAATTNNNG 

GATTNNTACAAANTITCTAGGNGGCACCTTNAAAANGNOTGGGNACNTT 

AAAAAAGCCCTTC^^IT^AG^^^^^'CCCNGAAATGGAAAAGTNCCAA^TITC^ 

ANGGGGTTTGTTNNCTTNCNANA 

* ■ * 

Ref4.2 

CSequence of BAC9 using primer HC3AS6, which spans nucleotides 924-944 of the cDNA. Exon 
"Hsequence is underlined and represents nucleotides 802-921. 

j lGACGCCAGCnTTGTACACAGTCTGCAATATGCTGTGCGGAAGGCCGATATCAACTATT 
Ci GCATCTTTGTCTNGNCATCGAGAAATCAGACTCTGTGGAAGNAGGCAGACAAGACTAT 
yACAGCOTGCACANAGCATCTCAGCAGGCATATAAGAAtGAANCITTTCCAGGGAGTTA 
iTTTATATACTACCrGAGCAAGNACTTCAACrTNGGCAGGAACTTGTGGATGNTTATAA 

GTATACTTATATGAATAANATNGAAATTAATATTTAATTCTTTTACTTCTrCTC^^ 
* TTATCITAGCCTTTATCCCCrCGTGAAAAAGAGCACTAATCAATGCTATTNCrNCCAAG 
NCAGGAATTTATAGCAGGTTGGTCGAGAATCGACACGACATGTTTACAGA^^•CATCTT 
J GAATACATNATTGTGCACGAGTmTTTACTCTATCrCAAAATATAGATCrCAGATGGTC 
3 TATNGANTATGNGGTTCTAGGACATGATTACATTTTTNGGGAACTTCCATAGAATAAA 
3 C^f^m'ACCTNAAAANANGAGCCTGTTNGAAATNGAATCrACTNCTAAAGGGCNAGTNC 
CANATTTTACITCCGCGANATNTCNGGATGTTACAAGTCTAGGGGGNCTTTAGNACGT 
TNGATmTTGANCGGAAAAAAGCCCTTCTANNGGTCNCCTAATGGAAGCGCCAATTCC 
NAANAAGGNCTGTGTT^m'NGAG\TTTACCNGNNCC^^TTCTAATCAAJ\C^™CTC^ 
TNNNANCCNCANCNCNNNCCtATAhWCCrATCNCTCNNCTNNNCrChn'C^ 
NCT^^■(^TCC^^TCTNCAC^^m^NN^^"CNCTNNNATNNNCTTCTCC^ 

^m^^'CA^^^CTCCCTACN^^'NCNcm•^m"ACCATCTNCNccNNCCT 

Ref5.1 

Sequence of BAC8 using primer C3S6, which spans nucleotides 1 127-1146 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1181-1269. 

GTATGATrCGCCAGACCGCTGCCATGGCAATNGTAGGGACATCGGTCCCTCAACTAAC 
AAGGCC^GGCAGTTTCCrNCTCACGTCAACGG TCAAAACAATCCTTCTACAGAATTTTT 
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TTTTCTNGAAAGACAAATATTTACTAGGATATGCCCTTAAATATATGAGATGATTGTAT 

CAGCrGATGCAAAAGTGCTCAGTTTTATTTATGAAAATATTAAAGTTCCCAGAATATTA 

ACTGTCTTCTCCCAAACAGTTrrAAAAAATGATrACCTCAAGGTTTATGGGAAAAAGC 

CCCGTATTCrGCATTCAGAATTTGGAAAATTGCCTCATTATAGATAGCCATNTCTTTTTT 

TTN l-l-ri riM I ATNCTTCAAGTCITAGGGNACATGTGCACAACATGCAGGNTAGTrACA 

TATGTATACATGTGCCATGTTGGTGTGCTGCACCCANNAACCCGCAATTTAACATTAGG 

TNTATCTCCAAATGCTATCCNTTCACCCTTCCCCCATNCCACAACAAGGCCCCGGGCNT 

TGNGATGTTCCCCITCCrGTGCCCACTGTGTNTCACATTNCCNCTTCCCNCCCTTANTN 
NNGTGCAGAACNTNGCCNGTNGCCCThrrNTTTTTNNCCC 

Ref5.2 

Sequence of BAC9 using primer C3S6, which spans nucleotides 1 127-1 146 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1 151-1269. 

CCNCTGATGATTATGAAAGTnAGAGCGGAAnTATGATAAGCCAGACCGTrGrrATr.r.r 
l aATCGCAGGGACATCGGTCCCTCAACrAACAAGGCCrGGGAGTTTCCTrCTCArGTrA 
jlCQGTAAAAACAATCCTCCTACAG AA 11 r 11 ' I Tl TCTAGAAAG ACAAATATTTACT AGG 

.^TATGCCCTTAAATATATGAGATGATTGTATCAGCITGATGCAAAAAGTGCTCAGGT^ 
■iTTTATGAAAATATrTAAAGTTCCAGAATATTTAACTGTCITCTCCC^ 

ryiu\TGATACCTCAGGTTTATGGGGAAAAAAGCCCCGtATrCTGTQ\TTCAGAAATTTGG 
.':3\A AATTTGNCTCATTATA GATAGTTCA l i ilCl i 11 1 1 1 li 1 1 1 111 irrr ATACNTTTAA 

AG r 1 1 I I AAGGGGNACCATGTTGCACCAAANATTGCAGGGGTTNGGTTACCATTATGG 
J-TTATTNCCATTGGTNCCCCCAm-GTTTGGGGNGTTGGCTTTGC^ 
f^ACCNNCCG^^■GCGAATTTT^AAAACAA^^^TTTGGGGGTTATTA^^ 

j^iNGGChnrriTNCCCTTTNCCCCCCTTTCNCCCNCCCNNTTCCCAACNNANCAAGG 

peCCCGGGTANTGGGGGAATAGNTTTTCCCCCCCTTNCCCnTvlGNNGGGCCAATrGTGG^ 

^^NNNCTCCATTTGG^f^^'GCAAA^m■CCCCCACCC^m^lATTGT^GGTGGNGAAACCA 

CCGGGGGGTTTGGGGG 1 " l l' 1 i i ri GGTCCCCmTGCCCAAOTAATTTTTGCNTTGAANA 

AAAAGAATGGGGTTTTCCAAAGCTTTTNGTCNCCATTTGm'CCCITTANG^ 

GTTCNCCTTNCCANAANGGGCCAATGTGAAACNNCCTTTCA 1 111 1 rrriA ITGGGGNT 
TNCCNTTATGGN 

R f 6.1 

Sequence of BA€8 using primer C3S7. which spans nucleotides 1513-1532 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1535-1588. 

TTCTTGGGAGCATAGGTGCCAGGT AAGAAATGGTACGGCGAAGCCGAGnACAGrTrn 
GTACGTACACAATAGCTTCrCCTCCTGGTGAGAATTTCITCAATTTC 
ATTGTAATGATCATTGTTGCTAGTCTrCAATGTCAATCCTATGCTTTTTAAA^^ 
TAAGTGTAACrGTGAATTAACTTGAATAATCATTTCrCTGCAGTAATAAAAGTTAGAAT 

i 

> 
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TCrGATTTAGGTGAGTCAGCATACCGCCCCCCCCCCCGTTTTCTCTAGAAAGTCrrCTC 

TAGAAAACGTTCrCTAGAAAGTCCtATCTAGAAAACriTCTCTAGAAAGTCCrTATGTG 

ATTAATAGCATCCATCGTCCCTTTTTAAATAGACTrTAll'lllGTAGAGCAGTTTTAAGT 

TCACAGCAAAAGTGAGCAAAGGTACAGAGATTTCCCATATACCCCTTAGTATGCGTAG 

CCTCCCCCATTATTAACATCCCCCATCAAGAGTAGTGGATTTGTTGTAACTGGTGAACC 

TACATTAACACAtCATCACCCAGAGTCCGCAGTTTACATTAGGGATCATTCATATAACA 

TCTATTTTTAC I" 1 ' 1 ' 1" 1 T i ' I ' J r 11 AGTTGA G ACAAG ATTCTCGCTCTGTCACCCAAGCTGG 

AGTGCAGTCCGNGTGGATTGTNGGCrTAGTGNCN 

Ref6.2 

Sequence of BAC9 using primer C3S7, which spans .oucieotides 1513-1532 of the cDNA. Exon 
sequence is underiined and represents nucleotides 1547-1588. 

GTGCCGGCAAGAAATGGTACGGCGAAGCCGAGGACAGCTCGG TACGTACACAATAn^ 

TTCTCCTCCTGGTGAGAATTTCTTCAATTTCCTTGAGTTGTATATTGTAATGATCATTGT 
TGCTAGTCTTCAATGTCAATCCTATGCTTTTTAAAAAGTGTTTTAAGTGTAACTGNGAA 

ejtaacttgaataatcatttctctgcagtaataaaagttagaattctgatttaggtgagt 
^eagcataccgcccccccccccgttttctctagaaagtcttctctagaaaacgttcrcta 
j^^'jsaaagtcctctctagaaaactttctcragaaagtccttatgtgattaatagcatccatc 
sjrrccctttttaaataagactttatttttgtagagcagtttta^ 

rikgcaaagggcagagatttcccatataccccttagtatgcgtagccrcecccattattaa 

;yj\tccccatcagagtaagngcatttgttgtaactggngaacctacaatrnacacatt 

^^''tnaccccacagtcccgcaggtttacatttatgggatcattcccctantaacacctattt 

|=iTTTTACrrn rri ri l l l ri AGGTTGAGACAAGAArri-I CGGCTTCITGTGTACCCCCAA 
fuACTNGGTAGTAGNCNNACCGTCGNGNAATTTm'GGGGTTTCOTNGNGNNCAhrrTTGTG 
tl CNNTCNNCTTNNCNCCNAAAGAANl "11 Trri ■ICACCCr 1' 1 11111 CCCCCNAANANANN 
p ANCCTTCCCCTTGGNGGANGCTGGGGACTTCCNCAGNGGNGG 

RefT.l 

Sequence of BAGS using primer C3S8, which spans nucleotides 1842-1860 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1861-1917. 

CAAAGTGGAGTTTATCTCAACACTGTTTTGCrCACAGAGAGCCTTGGTrTCAAAGG TAG 
GTTATTTTGTACCTGCAGTGTTGTCAG AC 1 ' 1 IG 11 1 ' I ' 1 '1 ' 1 TATTAACATTGTCTAAGATC 
ATTTGACACATTCATTGGTTAATATATGtAGTAATATATTAATGAATATGTGTAGTTAA 
AATTTAAATAATAACCTAAGACCCITAATTCTTCTTTGCCTCTCTACrGCTGCCTGCC^ 
TTAGAATTTTTCATTTATTCGAATCACCTTTAACCAGTTCTGGTTTGAAAAA^ 

TTGGATGTGGAGAAGGGCCTGAAATTAATAGCCAATCTTAAATATGGGGCITCTCTTG 
TTTTCTCTTCACTTGGTTCTGTTTTTATAAAAAACTCAATT^ 

AAGCAATTCAACCCACTGAAATTATTTTATGATGAATGGAAAAGAAGGTATGTGTTTG 
TTCAACTGCmAAATGTTTACnTCTTATATTTGTTTTCCCI^ 
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AAATTrrGAAGGTAGCTATTTCATTTTAATCATCCTAGAGGATGGAATGCANAGATGTT 

GGAtGAAAATAACTTACGTATTATTTTGTAATAAATATAAGAATTCATATATGGTTGAT 

TACCTAAGTGGTTTTATGCACATrCTGATAGAAAGCTTCACCAACAATCCCTTGNTNGA 
TATATTATTT 

Ref 7.2 

Sequence of BAC9 using primer C3S8, which spans nucleotides 1842-1860 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1864-1917. 

AAGTCAGTTTATCTACAAGACTGTTTTGCrACACAGAGAGCCTTGGTTTCA^ 
TTATTTTGTACCTGCAGTGTTGTCAG ACTTTG 1 1 U ' 1 T 11 ATTAACATTGTCTAAG ATCA 
TTTGACACATTCATTGGTTAATATATGTAGTAATATATTAATGAATATGTGTAGTTAAA 
ATTTAAATAATAACCTAAGACCCTTAATT C 1 i Ull 1 GCCTCrCTACrGCrGCCTGCCTTT 
TAG AA 1 1 111 CATTTATTCG AATCACCTTTAA CC AGTTCTGGTTTG AAAAACAGTAACT 

TGGATGTGGAGAAGGGCCTGAAATTAATAGCCAATCTTAAATATGGGGCITCTCTTGN 
TTTCTCTrCACnrGGTTCrGTTTTTATAAAAAACTCAATTTATA^ 

C|GCCATTCACCCACTGAAATTATTTTATGATGAATGGAAAAGAAAGGTATGTGTTTGTT 

jfACCTGCTTTAAAATGNGACKrChrrAATATTTTGGmTTCCCCT^^ 

|.ju\TCCTTAAAAmTTOGAAAGGGANGCTANriTTCATT^^ 

IjATTGGGAAAtGCNCAAGATTTTTTGATTGAAAAANAACCITANCGNAT^ 

lpGNAATAAAATAATTAGNAATTCC^^'ATTATG^^ITNGAATTACCTAAAGTGGTTT^ 

;:|rGCCCCATTTCmTGATATGNAAAGCCITTCACCAACCAAATTCCCCNTTGNNAGGAAT 

XtTA rr l 'ri I NANGGGCCrCOTNTTTGTGGGGNTGGAAGNAAAAACCrTTGTTCCA AAG 

i=5GGTCCCCNC 

£ n * ■ 

>t w 

■* *nr * 
» • * 

jr;Ref8.1 

□ Sequence of BAC8 using primer C3S 10, which spans nucleotides 2412-2^ I of the cDNA. Exon 
= J sequence is underlined and represents nucleotides 2437,-2523- 

AGTGCrAGTCCrCAGCAGCACrTnTT GCrGAATATTTGAGCATGTTGG AGO ACrnr. A A 
ATATCtTCCrGTGGGATGTGTAACATTTCAGG TAGGAATrTTrrAOATr.TAnA'rTrA a at 

CAAGGTATATCri4U-iriGGTTTTAGCITTTCTCACTGGTGTTTAGA 141i4 14 AGT^ 
AAGGAAAGCTTAAAGACITAAGCCAATGCnTCACAAGGTGAATTAACATTTCACAGTG 
ATTGTCATTAATACATTTITAAGGAGTACTTCTTGTTGATTCTCTTTCCACAGTT^ 
CCTCTGAATTATCAGCACTATGCTTATTTATTCrCITTGGCrTTACTG 

GTTACATACTTTAACATCTATGGAAATGTATTACrGATAATCAGAATTCAGTAGAAATT 

CnTAATTGGCTTTrACTTCACATAGCAGATATACCAACATTCTCTATTCCCTACATAAA 

ATATTAAGATTATTTTATGACTAATACCCATGACrCACAGATGAGTTrGCCCrCTAGTA 

GGGTCATAATTCTGACCCACTAGTrGAATTCTCTCXnTACCAAGAGNCAGGTATGCT^ 

C 1" r 1" 1" ICl- ICAAAACCTGrrAAATAGTAGGhm-GGGGATAtThnrAAAAATTAGGTAAAT 

GGTATATCTTCTGGTGGAAANCAGAAN 
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Ref8.2 

Sequence of BAC9 using primer C3S 10, which spans nucleotides 2412-243 1 of the cDNA. Exon 
sequence is underlined and represents nucleotides 2444-2523. 

CAQCA(K:ACnTGTTGCrGAATATTTnAO C ATGrTnr. AOGACCGr. A A ATATC-TTrf-rr.T 

GOqATGT qTAACATTrCAGG TAGGAATCrrCCAGATGTACATTAAATCAAGGTATATr 
1 n 1 1 1 1 GGTITTAGCmTCTCACrGGTGriTAGATTTXITrAGT^ 

AAA GACTTA AGCCAATGCITCAC^GGTGAATTAACATTTCACAGTGAT^^ 
TACATTTITAAGGAGTACTrCrTGTrGATTCTCITTCCACA^ 

TCAGCACrATGCTrATTTATTCrcriTGTCriTACTGCCTTGTAATCCGTrACA^ 

AACATCrATGGAAATGTATTACTGATAATCAGAArrCAGTAGAAATTCnTAATrGGCrr 
^m'AOTCACATAGCAGATNTACCAACAT^C^CTAT^CCCTACATA/^ 

ATTTTATGACTAATACCATGACrCACAGATTGAGTrrGCCCrCrAhrrAGGGTNCA^ 
TTCTGACCCACTAGTTGAATTCTCTGCITACCAAAAGTCAmTATGGCriTGC^^ 
TC^AAACCCTG^^^^AAT^AGGNACGGCmGGAGATAATITATAAAAAA^^ 
NAAAhrrGGhnTATrATTChnrCCNNGGTTGAAAAAACCCAGGAATT^ 

j^nriTTACCAAAAANGACCTCOJC^ 
„pRef9.1 

Cfi Sequence of BAC9 using primer C3S 11. which spans nucleoUdes 2679-2698 of the cDNA. Exon 
iLi. is underlined and represents nucleotides 271 1-2799. 

TTCCrATTCATgAA<KTAATCGGGATGrAAAGAAA rrATCCA CA ATT CATnnTAA^ r ^ 

TCAAGAAGCATTCAGCAAAATTnTTrATrArw -.TA Axn A-rrr-/-^ ^nTCrACCrTCACTA 

TAAAGGGAAAAAACTGTCTGAAAGCATrAATGTTGTITTGCACTGATGTCAAACrAGA 
TCCCGTGAAATGACCATTTTAATCAGACTACAAATGAGCGGTCAAAATGATAGTrCAT 
GGCCAAAGCAAAGCTCATTAACAATAAAAATGAATTCACCrAAAGTAAATGGTGATCA 
TCATAAACnrCTGCATAGCl 1 1 11 1 1 i 11 rCATnTrGAATTATrAATTAAGCAAGTTTT 

TAAAAATTGTGATITrCTGTITCACAAGGNAAGATCATAAGTrGNGGAATCrCATTnT 
AA AAATTGAT ACCCTATmCTITTGCrGNGGAAAAOTGGAAGTTITT^ 

GGi I 1 11 i 1 lAAAATm AAArGGATrGrGGAAAACCmTAAATNAATITAAAACCrAC 
CTAAAA TANi i n 1 1 1 AATGGNCCNNGCCANCTGGAACCNTTTTA 1 1 I ' l l TI CCCCTAG 
GAA TGCTTTTACCCAAA TCCATTCCCnTITGAATAATATITTTCCCm^ 
AAANi 1 1 1 iiNi 1 1 1 1 1 IGGGNGGAAAAAATANTTGGAAAATTAAAAAAATGGGGGTGG 
GGCC KTAAATG GGGATTATITITAAATXrCCTAAAAAAGGGAhnxrTCCAT^ 
NAATCCirrrrGGGNGGhm'C^IATITATrGGGGAATCCnTvJC^CIT^^ 

AAAAA^^TAGGGCC^NCCAAAATTITAAACC^^rT^AATITITNAA^ 
CCCl l lCriNGCCCGGTTGGTTT 
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ReflO.l 

Sequence of BAC8 using primer C3S12, which spans nucleotides 2908-2927 of the cDNA. Exon 
sequence is underlined and represents nucleotides 2928-2941. 

TACAGATTGGAGGT GAATGCTGTGGTCKjTTCATAAAATGTCATCrTTAnTTTnTATTrT- 

CTCTGATGATTAGACTTTCAGATCCAGATCTAATCATTTAGTAAGCCAGATCTTGCCAA 
AT AAACTACTCCGTTAG AG AATAAGG ACmTAATAGTTA CAATAATACTCTTTCAAAT 

CTTTTATGGCAGCAATAAAATAGTAATATTGTCTATTTITTGAGACTATTT^ 

ATTTTAGAAACCCCTGTATCCTTCAGAATTACrGCGACTrAACGGAGAAATATATAGTA 

TAATCCCACATTTTGTTGAAAAAGAO\AAGAATTAAGTAGTAGCTAATAj\TTGAACT 
GAACC:AGAACCCTAAGAAATTTCTGACCCAAGC\TATTATCTCTTTGGCTTy^ 

CCAGGTGAGGriUCrilAGAACGTAAAAGCCTGAAATCACACCTTAAAAACACITCCT 

TTAACCITTATAATTTCrTAATTTTCACCATAAATGATTGCGTTT^ 

TAACTAGNATTTTCTGOTATAGGTATTCTTTCCAACCTTTCrCTATTT^^ 

AGTGTAGTGGATGGACCGGAAGCATTGGGGTTCACCTGGGAGAATGGTTGGNAATGC 
|:3\GAACCCTTAGACCCCACCCCAGCCCCTGTGAAA 

i?keflO^ 

' jSequence of BAC9 using primer C3S 12, which spans nucleotides 2908-2927 of the cDNA. Exon 
'^sequence is underiined and represents nucleotides 2930-2941. 

:£ cAGATAGTAGGT GAATGCrGTGGTGGTTCATAAAATGTCAT(TTTTAnT^ 

CTGATGATTAG ACTTTCAGATCCAG ATCTAATCATTTAGTAAGCCAGATCTTGCCAAAT 

AAACTACTCCGTTAGAGAATAAGGACTTTTAATAGTTACAATAATACTCTTTCAAATCT 
^'TTTATGGCAGCAATAAAATAGTAATATTGTCTATTTTTTGAGACTATTT^ 

y TTTAGAAACCCCTGTATCCTTCAGAATTACTGCGACTTAACGGAGAAATATATAGI ATA 
3 ATCCCACATTTTGTTGAAAAAGACAAAGAATTAAGTAGTAGCTAATAATTTGAACTAG 
3 AACCAGAACCCTAAGAAATTTCTGACCCAAGCATATTATCTCnTTGGCrTAACrGGTT^ 
CAGGNGAGGTATCnTTAGAACGTNAAAGCCTGAAATCACACCITAAAAACACITNCT^ 
TAACCCTTTATAANTTNCrrAATTTTCACCCATAAAATNGATTGCNGTTT^ 
CTNGGGNCTANACCTNAGGCAATTTTrCTGGGTCATAAGGGAA A 1 1 i C i ' 1 1 ' i ' I CCCAAA 
CCCTTTTCTTCITATTTTGTTGGGNCACCTCCCAAAAGGTGKr 

NCCCCGNGAAAGGCCATTTGGGGGhTITCACCCCCGGGGGANGATrTGGTTTTGGi^^ 
TNGTC^WNAAAACCTTCC^^^ACCCCaWCCCCCNGGCCCCC^^'GNNNG^ 

AGGATCHTGNCNA' 11" [■ 14UMANCCAAANGANCNCCCCCNAGGGNGGG ATTTTNGTT AT 
TTCCCCAANAAGANGTAAGGTT^^^TGGCCIT^^IGGGGCmTGGGTGTTTTT^^^ 

Ref 11.1 

Bac9 sequenced with HC3AS3, which spans nucleotides 3376-3393 of the cDNA. Exon sequence 
is underlined and represents nucleotides 3267-3289. 
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CTGANGTGTNGCAANGCCACrCCTGTGTCITTTCrGCATGTCCTCAATAGCAAC^ 
TCGGTGTTAAG ATGATCT GAGTNANNGAGCATCTGTTAN ATC AGNGTArTG ACrr. A A A 

CTATTTAATGAACTTTATGTATAATCAACTGAAATTAGANAAAAAAAAGATCAATNGT 
AAACTTCATGTAj\CAATAAAATTCCAAACTTGGATTCTAAATGAANNAAAAANATCAA 

cctttaaagaaaagctgggggtgaataagggcttagaaaagangtanaaaatganga 
ctcaaaatggtaaagggtctaatatgnatggataaggatggacatatcttcggactct 

GAGTGGTGTACATGGCrrGATGATTGCTCACTATGTGTGNCATTATGGCTACCTCTCTT 

TAGGCATGCCTGTTAAhrrAGGAAGCrGAACTANCAAAGNCTCrrNGATGTATNANTCC 

TGCCGCrNAAGAAGGGGNCGCOTGAhWCAAATGATTTGCNATGTNTCTGCTATNATNG 

NAAGNGNTCCTNGAKnWITCNGANAAANCTCTCNANGAGNCTAGTTTACATNCGGTC 

AGNGCTTCITGCACCTCGTGNGCATCTCCCGTAhnTCACCCrCATTNNAGCN™ 

ATAANtiANNNAGCCCACtmWCCTATAGGCt^ATCtiACGCNm^ 

NAGACAATTTTTT^^^CGCCCeC^CCTN^r^CCTTCC^NNCT^ 

TCTNTNCCCCCNCCNNTTCTTANCrTNCr 

fflef 12.1 

H^pequence of BAC9 using primer C3S 15, which spans nucleotides 3750-3769 of the cDNA. Exon 
iisequence is underlined and represents nucleotides 3779-41 18, and also rejpresent the 3' terminus of 
^'jthe transcriptional unit. 

^ Jtgtgaaaagatctattggaaaacaacatggaatggaattctggaaattattattnatt 
c hgaagaatgcagtggccaagaaaatatcaaatgtagattgttaacgcttgagaatcatg 
« gctatggtttctaatgttctggtaacaagcrgttatcttrraagacattttaatga(tr 

!:^ AAAGGTACA(TrATACATTTACCATrATTTATACCATAGCTAAGGTTAAAAATTTATTrA 

CITTAAGTTCGTATTTTTTAATrTATATTACCATTTATAGATTCAT^ 
la AAATGTAGTAATGCTTATTTTAAAGGTACtATTAAATATGTGAATGTITACA CrAATTT 
g TACCGAGTGGGACTTCAAAATTTTTATTATTGACAATGGCTGAGAACAATTN 

TTGACrCNAGAACTANTTCCAAACCTAGCAGAATAAAAATCATAGATAGCCCCAAATT 

AATG AGTTTGGGNAACTGTNTCAAAG II' 111' 11 CCATTTACATACCCAAAAACAGG AA 

ATTTTAGAATTTGCO^GAACCnTTACCITAAGANAAAAACCCITTTGTGh™ 

TAhnWTAAAATTCCCGGGGGGANTAATCTTAATNACCCCGGGTGGGGCCAhfNCNCCC 

CmTATAACTTTGGAATTTAAAAATTCNTTTTTThrnslO 

NmTTTNAAGGAAAACCTTTCCACnNGGAAGTTNNCTTTTA^^ 

AAANNGGGGAANATTGGGAAGTCITCCCCITCNTTNGGGGGGNGNCCCAAAAAATTCT 
T AAT AAAANCCCCGGGGCTCCCATTNTTAGN A 1 T 1 T i ' 1 T 1 ' 11 ' 1 GGCCCCACACTGTGTT 
NATTAAANCCCCNCm'GCrAAAAATTTTTNhnsiGAAAANACCTNAACCCnT 
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HC2A 

K lAA ASGNLDKNARJTSAI YKQDSNKLS>ai)MIJai>ADFRKPEKMAKI^ TlDmSSD 

rat 

HC4 

HCl 

HC3 

HC5 • 



rat 

^jHC3 



LM rat 
K HC4 
Li^ HCl 
II I HC3 
HC5 



HC2A = 

KIAA FPIIYVNSS YIPTKQFETCSKTPI TFEVEEFVPCIPKHTOPYTIYTUHLyvyPKYLKyDSQ 

rat 

HC4 

HCl 

HC3 ' ' ' 

HC5 

HC2A VIiHHHQNPEFYDEIK 

KIAA KSrAKARNlAJCIEFKDSDEEDSQPlJCCIYGRPGGPVJTRSAFAAVT.HHHQNPEFYDEIK 



HC2A lELPTQUiEKHHLLLTFFHVSCDNSSKGSTKKHDVVETQVGYSWLPLLKIXSRVVTSEQHI 
KIAA IEIJ>TQLHEKHHlJ*LTrFHVSCJDNSSKGSTKKRDVVETQVGYSWIJ>I^^ 



G HC2A PVSAmiPSGYI^YQEI^GRHYGPEIKimXSGKPLIJCISTHLVSTVYTQDQHLHNFFQYC 

£3 KIAA PVSANIJ>SGYIX3YQ£IX^GRHYGP£IKWVIXXSKP1JJCISTHLVSTVYTC^ 

rat 

HC4 

HCl 

HC3 GPGPARSTVSISLISNSARV 

HC5 



HC2A QKTESGAOAlXSNELVKYUCSXJiAMEGHVMIAFLPTIUJQLFRVLT-RATQEEVAVN^ 

KIAA QKTESGAOALGNELVKYIJCSIJIAMEGHYMIAFLPTILNO^ 

rat 

HC4 MEIQVLIRFLSVIU4QLFWVLPNMIHEDDVPISCPMV 

HCl MSFLPIILNQLFKVLV-QNEEDEITTTVTRV 

HC3 NRSRSLSNSNPDISGTPTSPDDEVRSI IGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 

HC5 



< 

> 
> 
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HC2A 

rat 
HC4 
HCl 
HC3 
HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

KC5 



I IH\'VACCK£^LESHlJ^,yVKYAVKA£PYVASEyK'r/HEEL 
I IHV/A:C^^i£SHLRSr/KYAYKA£?yVASEyKr'/HE£I- 



m 



TILKPSAPrLTSN 
LK?SADn?SN 



LF7iIVSKCHEEGLDSYLSSriKYSFRPGKPSAPQAPLIHETU\TMMIALU<QSADr--.:N 

LPDIVAKCHEEQLDHSVQSYIKP/FKTR ACKER PVHEDLAKN\^G LLK- SNDS rT\-^: 

TQAKDRSCNPJ^SHTETSSFXOTLTGRI-? TKKLFHEELALQK'yVCSG--SV?. Z 

Cadherir. 
Cleavage 

KLLRYSWrfFDVLI KSHAQHLI ENSKYKLI^^oSfPASYHHAAETV^^ 

KLLKYS W F FFDVL I KSMAQHL I ENS KVKLI RNQF FPAS YHHAVETWNMLMPH I TQK FRD 



KLLKYSWFFFEI 
HVLiijSWFFFAIILK 
SALW^AW FFFEIiJ^/K 



SMVHHLYFNDKLEAP RKSP 



lAKSMATYLLEENKIKLl HGQP FPKAYHHALHSLFLAIT 

KSMAQHLIDTNKIQLE RPQP FPESYQNELDNLVMVLSDHVIWKYKD 



- IVESOYAE 
DHVIWKYKD 
FPERmDDIAALVSTIASDIVSRFQK 



HC2A 
KIAA 
rat 
HC4 

HCl 
HC3 

HCr 





HC2A 




KIAA 




rat 




HC4 




HCl 




HC3 


ij,; S 


HC5 


ru 


HC2A 


f ~ 


KIAA 


i - ; 


rat 



HC4 
HCl 
HC3 
HC5 



NPEASKHANHSLAVFIKRCFTFM)RGFSrFKQIN-^ — NYIS — CFAPGDJ^TLFEYKFEFL 
NPEASKNANHSIAVFXKRCFTniDRGPirFKQIN NYI S — CF3\PGDPKTJ>FEYKFEFL 

I PKESRMVNYSLASFLKCCLTLMDRGFSrFNLIN DY IS — GFSPKDPKVLAEYKFEFL 

ALEETRRATHSYARFLl^CFT IMDRGCVFKMVN NYI S — MFSSGDLKTLCQYKFDFL I 

DTEMVERLNTSIAFFl^Ll^VMDRGFVFSLIKSCYKQVSSKLYSLPHPSV^ 3i 1/3 



RWCHHEHY I PLNLPM- 
KWCNHEH Y IPLNLPM— 



■--PrGKGRIQR- 
— PFGKGRIQR- 



YQDLQL— 
YQDLCL— 



— DYSLrrEF 
— DYSL7--EF 



OTICN'HEHYI PLNLrM AFAKPKLCR VQi>£KL EYSL5I EV 

OEVCCHEHFIPLCLPIRSANIPDPLTPSES-= TQELHASDMPEYSVTNE F 

RIICSHEHYVTLNLPCSLLTPPASPSPSVSSATfeQSSGFSTNVQDQKIANMFELS— VPF 
MNADTAPTSPCPSIS SQNSSSCSSFQDQKIASMFDRTSRVPA 



Cadherin 



EC 

CRNHFtVGL 
CRNHFX.VGL 


: mot 

LLRE 
LLRE 


VGTALQE FRE VRL I A I SVLKNLL I KKS 5T)DR YASHSHQAR I AT 

VGTALQEFRE ^.VRLIAISVLKNLLIKHSFDDRYASRSHQARIAT 


CKHHFLVGL 
CRKHFLBGI 
RQQHYLAGI 
SSTS-SPGL 


LLRE 
LLRE 
^TE 
LFTE 


TSIALQDMYE IRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQ 

VG FAirOEOQD VRHIALAVLKNLMAKHS FDDRYREPRKQAQI AS 

LAVXLDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
LAAALDAEGEG I S EVQRKAVS AI HS LLS SHDLDPRCVKPEVKVK I AA 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



LYLPLFGLLIENVQRINVRIWSPFPVHAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 
LYLPLFGLLIENVOR INVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPOKGS TLDNSLH 

LYLPrVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG — FTSP~AN~RGSLS 

LYMPLYaaLDNMPRIYIJCDLYPFTVNTSNcfeSRDDLSTNGGFQSQTAIKHANSV^ 

LYLPLIGIIMETVPQLYDFTETHKORGRPICIATPDYESE SG 5MI5 

L YLPLVG 1 1 LDALPQLCDFTVADTRRYR TSGS DEEQE GA GAI T 



B 
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HC^A KDLLG.^ISG4=^FYTTi7='N:N5%TLNADSRGSLISTDSGNSLPERNSEKSNSLDMnC'QS5 

KIAA KDLLGAISGJ^PYT-->"PS':sV-VRKAn5;RGSLISTnsr,N5;i.PrPUgrk'<;>j<:T ntrjr^nQc: 

rat 

HC4 TDKDTAYGsUR HG I KRZDSRGSL I P-EGATGFPeJ^G EN TRQS 

HCl KDVLNSi;>^FSS lAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKI PRPL ^ n 

HC3 QTVAMAIAGTSVPQ LTRPGSFLLT^SGRQHT S*'/^'^ 

HC5 QNVALAIAGNNFN LKTSG-IVLS3LPYKQYN 



HC2A TLGNSV'\TlCDKLDQS£::<SLI_ycrLyilK5MSDDALn'YTO-KASTSEI>IDFrTIS£VC^ 

KIAA TLGKSVVRCDKLDQS£IKSU><CFl.yilJCSMSDDALrry>OI-K^ 

rat 

HC4 STRSSVSQYNRLDQYEIP^LU^CYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 

HCl ALIGSTLRrDRLIX5AETRSLl>ICrUiIMKTISYETLIAYW0-RAPSPEVSDFFSIlfc'V^ 

HC3 : T FS AESSRS LLI CLLWVLKN- ADETVLQKW FTDLS VLQLNRLLDLiJy LCV 

HC5 MLNAPTTRNIJ4ICFLWIMKN-ADQSLIRKHIADLPSTQIJTOILDLLriCV 

HC2A HQEjbYMGKRYIARNQEGLG — PIVHDRKS QTLPVSRNRTGMM 

KIAA HQFQYMGKRYIAR ^ -TGMM 

rat 



HC4 FHFRYMGKRNIARVHDAWLSKHroiDRKS QTMPALRNRSGVM 

HCl QNFRYLGKRNIIRKIAAAF — KF^QSTQiWGTIJ^GSNPSCCrrSGLlAQHMHSTSRHEGHK 

HC3 SCFEYKGKKVFERMNSLTFK — KSKDMRAK LEEAILGSIGARQEMV 

HC5 LCFEYKGKQSSDKVSTCVUJ — KSRDVKAR LEEALLRGEGARGEMM 

HC2A HAKLQCL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

:™ KIAA KARLQQL QSLDHS LTFNHSYGHSDADVLHQSLLEANIATEVC 

'^^ rat . i : 

HC4 CARLQHL 55LE5S FTLNHSSTTTEADIFH0ALLEGNTATEV5 

"'-J HCl :»hrsotl?i:rgk — nalsnpkl lc^ildntmtIsnsnzidivhhvdteaniategc 

rU HC3 RRSRGQl^RSPSGSAFGSGENIJ>JmKrWTHWRQNTEKLDKSRAEIEHEALIDGNIATE^ ^•//r*^ 

HC5 RRRAPGNDRFP GLNENXJOOOCEQTHWROANEKLDKTKAELDQEALISGNIATEAH 

m 

HC2A LTAIJDTI^LFTIAFI^CI-IADHGHNPUIKKVFI^^ 

KIAA LTAIJDTLSLFTIAFKNCIJAPHGHNPIMKKVFIJWI^ 

|- rat KI^RGHSPI>IKKVFimu:FLQKHOSEMALKNVrrALRSLIY 

r: HC4 ITVLDTISFFTQCFKTHrLNNDGKNPI^IKKVFDIHIAFUC^ 

HC: X-TILDLVSLFTQTHQRCLQQCDCQNSlWKRGrDTYMLFFQVNQSATALKHVFASLRLrVC 

□ HC3 LIILDTLEIWQTVS--'/TES--KESILGGVLKVLLHSMACNQSAVyLQHCFATQRAI-VS 
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ile ser arg thr val ala ala glu val arg 
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CCC CAA CTG CTC AAA AAC CTT AAT ATT GTT 
pro gin leu leu lys asn leu asn ile val 
151/51 

CTT ACC GAA GCA GTA GAT CCA GTG GAT TTG 
leu thr glu ala val asp pro val asp leu 
211/71 

GTG GAT TCT GGG CCT TTA CGG GAT TTG ATT 
val asp ser gly pro leu arg asp leu ile 
271/91 

TAT AGT CCT CGG GAC TGC AGA ACT CTT GTT 
tyr ser pro arg asp cys arg thr leu val 
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CCA CAT GTT AGA GAC TGT ATA AGA AGT TAT 
pro his val arg asp cys ile arg ser tyr 
391/131 

TAT CAT AAA TTG GGA ACA GGA TTT AAT CCC 
tyr his lys leu gly thr gly phe asn pro 
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AAA GGT TTG CCA AAA CAA GTT TTT GAA TCT 
lys gly leu pro lys gin val phe glu ser 
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GAT GAT CAA GAT GAC CTT AAA AGA CGT TCA 
asp asp gin asp asp leu lys arg arg ser 
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trp ala cys ser ile phe asp leu lys asn 
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gly leu leu arg pro his val pro pro ala 
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TTT TCT ATC ACT TAT CCT TCC CAA GAT GTT 
phe ser ile thr tyr pro ser gin asp val 
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asn val glu val val ala val ser ser ile his thr gin asp pro tyr leu asp lys phe- 
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ile met glu asn asn leu glu asn glu leu lys ser ser ile ser ala leu asn ser ser 
2341/781 2371/791 

CAG CTG GAA CCA GTG GTC CGA TTT CTT CAT CTT CTG CTA GAT AAA CTG ATA CTT TTA GTT 
gin leu glu pro val val arg phe leu his leu leu leu asp lys leu ile leu leu val 
2401/801 2431/811 

ATT AGA CCT CCT GTC ATT GCT GGC CAA ATA GTT AAC CTA GGT CAA GCA TCT TTT GAA GCC 
ile arg pro pro val ile ala gly gin ile val asn leu gly gin ala ser phe glu ala 
2461/821 2491/831 

ATG GCA TCA ATT ATA AAT CGA CTT CAC AAA AAC TTG GAA GGA AAT CAT GAC CAG CAT GGC 
met ala ser ile ile asn arg leu his lys asn leu glu gly asn his asp gin his olv 
2521/841 2551/851 

AGA AAC AGC CTT CTT GCA TCA TAT ATT CAT TAT GTT TTC CGC CTA CCA AAT ACT TAC CCT 
arg asn ser leu leu ala ser tyr ile his tyr val phe arg leu pro asn thr tyr pro 
2581/861 2611/871 

AAT TCA TCA TCA CCA GGT CCT GGG GGT TTG GGA GGA TCA GTG CAT TAT GCC ACA ATG GCT 
asn ser ser ser pro gly pro gly gly leu gly gly ser val his tyr ala thr met ala 
2641/881 2671/891 

AGA TCT GCG GTG AGA CCT GCA AGC CTT AAT TTA AAT CGT TCT CGA AGC CTT AGT AAT AGC 
arg ser ala val arg pro ala ser leu asn leu asn arg ser arg ser leu ser asn ser 
2701/901 2731/911 

AAT CCA GAT ATA TCT GGG ACT CCC ACG TCA CCA GAT GAT GAA GTT CGA TCA ATC ATC GGG 
asn pro asp ile ser gly thr pro thr ser pro asp asp glu val arg ser ile ile qlv 
2761/921 2791/931 

AGT AAG GGT TTA GAT CGC TCC AAT TCC TGG GTT AAC ACT GGT GGT CCA AAA GCT GCC CCA 
ser lys gly leu asp arg ser asn ser trp val asn thr gly gly pro lys ala ala pro 
2821/941 2851/951 

TGG GGA TCC AAC CCC AGT CCA AGT GCA GAA TCA ACA CAG GCT ATG GAT CGA AGT TGT AAT 
trp gly ser asn pro ser pro ser ala glu ser thr gin ala met asp arg ser cys asn 
2881/961 2911/971 

CGT ATG TCT TCG CAC ACA GAG ACG TCA AGT TTC TTA CAA ACA TTA ACG GGA CGC TTA CCA 
arg met ser ser his thr glu thr ser ser phe leu gin thr leu thr gly arg leu oro 
2941/981 2971/991 

ACT AAA AAG CTT TTT CAC GAG GAG CTG GCT TTG CAG TGG GTT GTT TGC AGT GGC AGC GTT 
thr lys lys leu phe his glu glu leu ala leu gin trp val val cys ser gly ser val 
3001/1001 3031/1011 

CGG GAA TCA GCT TTG CAA CAA GCC TGG TTC TTT TTT GAA TTA ATG GTA AAG AGC ATG GTG 
arg glu ser ala leu gin gin ala trp phe phe phe glu leu met val lys ser met val 
3061/1021 3091/1031 

CAC CAT TTA TAC TTT AAT GAT AAA CTT GAG GCT CCA AGG AAA AGT CGT TTT CCA GAA CGT 
his his leu tyr phe asn asp lys leu glu ala pro arg lys ser arg phe pro olu aro 
3121/1041 3151/1051 ^ ^ ^ M g 

TTC ATG GAT GAC ATT GCA GCT CTT GTC AGC ACG ATT GCT AGT GAT ATA GTT TCA CGA TTT 
phe met asp asp ile ala ala leu val ser thr ile ala ser asp ile val ser arg phe 
3181/1061 3211/1071 

CAG AAG GAC ACA GAA ATG GTT GAG AGA CTC AAT ACA AGC CTT GCA TTC TTT CTC AAT GAT 
gin lys asp thr glu met val glu arg leu asn thr ser leu ala phe phe leu asn asp 
3241/1081 3271/1091 

CTG TTG TCT GTT ATG GAC AGA GGA TTT GTT TTT AGC CTT ATA AAG TCC TGC TAT AAA CAG 
leu leu ser val met asp arg gly phe val phe ser leu ile lys ser cys tyr lys gin 
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3301/1101 ^ 3331/1111 

GTG TCT TCA AAG CTT TAC TCA TTA CCG AAT CCC AGT GTT CTG GTG TCC TTG AGG CTG GAT 
val ser ser lys leu tyr ser leu pro asn pro ser val leu val ser leu arg leu asp- 
3361/1121 3391/1131 

TTT CTA CGA ATC ATC TGC AGT CAT GAG CAC TAT GTT ACA TTA AAC TTA CCC TGC AGC TTA 
phe leu arg ile ile cys ser his glu his tyr val thr leu asn leu pro cys ser leu 
3421/1141 3451/1151 

CTT ACT CCA CCT GCA TCT CCA TCA CCT TCT GTT TCT TCT GCA ACA TCT CAG AGT TCT GGA 
leu thr pro pro ala ser pro ser pro ser val ser ser ala thr ser gin ser ser gly 
3481/1161 3511/1171 

TTT TCT ACG AAT GTA CAA GAC CAA AAG ATT GCA AAT ATG TTT GAA TTA TCC GTG CCT TTC 
phe ser thr asn val gin asp gin lys ile ala asn miet phe glu leu ser val pro phe 
3541/1181 3571/1191 

CGC CAA CAG CAT TAT TTG GCA GGA CTT GTG TTA ACA GAG CTG GCT GTC ATT TTA GAC CCT 
arg gin gin his tyr leu ala gly leu val leu thr glu leu ala val ile leu asp pro 
3601/1201 3631/1211 

GAT GCT GAA GGA CTG TTT GGA TTG CAT AAG AAA GTC ATC AAT ATG GTA CAC AAT TTA CTC 
asp ala glu gly leu phe gly leu his lys lya val ile asn met val his asn leu leu 
3661/1221 3691/1231 

TCC AGT CAC GAC TCA GAC CCG CGG TAC TCT GAC CCT CAG ATA AAG GCT CGA GTG GCC ATG 
ser ser his asp ser asp pro arg tyr ser asp pro gin ile lys ala arg val ala met 
3721/1241 3751/1251 
^TTG TAT CTA CCT CTG ATT GGT ATT ATC ATG GAA ACT GTA CCT CAG CTG TAT GAT TTT ACA 
^leu tyr leu pro leu ile gly ile ile met glu thr val pro gin leu tyr asp phe thr 
13781/1261 3811/1271 

I GAA ACT CAC AAT CAA CGA GGA AGA CCA ATT TGT ATA GCC ACT GAT GAT TAT GAA AGT GAG 
^glu thr his asn gin arg gly arg pro ile cys ile ala thr asp asp tyr glu ser glu 
3841/1281 3871/1291 

AGC GGA AGT ATG ATA AGC CAG ACC GTT GCC ATG GCA ATC GCA GGG ACA TCG GTC CCT CAA 
ser gly ser met ile ser gin thr val ala met ala ile ala gly thr ser val pro gin 
3901/1301 3931/1311 

CTA ACA AGG CCT GGC AGT TTC CTC CTC ACG TCA ACG AGT GGC AGG CAA CAC ACT ACC TTT 
leu thr arg pro gly ser phe leu leu thr ser thr ser gly arg gin his thr thr phe 
3961/1321 3991/1331 

TCA GCA GAA TCA AGT CGA AGC CTT TTG ATC TGT CTA CTT TGG GTT CTC AAA -AAT GCA GAT 
ser ala glu ser ser arg ser leu leu ile cys leu leu trp val leu lys asn ala asp 
4021/1341 4051/1351 

GAA ACA GTT CTA CAG AAG TGG TTT ACA GAT CTC TCA GTC TTG CAG CTA AAC CGG CTA TTA 
glu thr val leu gin lys trp phe thr asp leu ser val leu gin leu asn arg leu leu 
4081/1361 4111/1371 

GAT CTG CTT TAT CTC TGT GTG TCT TGC TTT GAG TAT AAA GGG AAA AAA GTG TTT GAA CGA 
asp leu leu tyr leu cys val ser cys phe glu tyr lys gly lys lys val phe glu arg 
4141/1381 4171/1391 

ATG AAT AGC TTG ACC TTT AAG AAA TCA AAA GAC ATG AGA GCA AAG CTT GAA GAA GCT ATT 
met asn ser leu thr phe lys lys ser lya asp met arg ala lys leu glu glu ala ile 
4201/1401 4231/1411 

CTT GGG AGC ATA GGT GCC AGG CAA GAA ATG GTA CGG CGA AGC CGA GGA CAG CTC GAG AGA 
leu gly ser ile gly ala arg gin glu met val arg arg ser arg gly gin leu glu arg 
4261/1421 4291/1431 

AGC CCA TCT GGA AGT GCC TTT GGA AGT CAA GAA AAT TTG AGG TGG AGG AAA GAT ATG ACT 
ser pro ser gly ser ala phe gly ser gin glu asn leu arg trp arg lys asp met thr 
4321/1441 4351/1451 

CAC TGG CGT CAA AAC ACA GAG AAG CTT GAC AAA TCA AGA GCA GAG ATT GAA CAC GAA GCA 
his trp arg gin asn thr glu lys leu asp lys ser arg ala glu ile glu his glu ala 
4381/1461 4411/1471 

CTG ATT GAT GGA AAC CTG GCT ACA GAA GCA AAC CTA ATC ATT TTA GAT ACA TTA GAG ATT 
leu ile asp gly asn leu ala thr glu ala asn leu ile ile leu asp thr leu glu ile 
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4441/1481 
















GTT GTT CAG 


ACC 


GTT 


TCT 


GTA 


ACG 


GAA 


TCC 


val val gin 


thr 


val 


ser 


val 


thr 


glu 


ser 


4501/1501 
















GTG CTA CTA 


CAC 


AGC 


ATG 


GCC 


TGT 


AAC 


CAA 


val leu leu 


his 


ser 


met 


ala 


cys 


asn 


gin 


4561/1521 
















ACA CAG AGA 


GCC 


TTG 


GTT 


TCA 


AAG 


TTT 


CCT 


thr gin arg 


ala 


leu 


val 


ser 


lys 


phe 


pro 


4621/1541 
















TGT GCT GAT 


TTA 


TGC 


CTC 


AGG 


CTT 


CTC 


CGA 


cys ala asp 


leu 


cys 


leu 


arg 


leu 


leu 


arg 


4681/1561 
















TCA CAC CCC 


AGT 


GCC 


TCC 


CTT 


TAC 


CTA 


CTA 


ser his pro 


ser 


ala 


ser 


leu 


tyr 


leu 


leu 


4741/1581 
















TTT GCC AGG 


GTT 


AAA 


ATG 


CAG 


GTA 


CCA" 


ATG 


phe ala arg 


val 


lys 


met 


gin 


val 


pro 


met 


4801/1601 
















AAT TTT AAT 


GAA 


GAA 


TTC 


TTA 


AGA 


CGT 


TCT 


asn phe asn 


glu 


glu 


phe 


leu 


arg 


arg 


ser 


4861/1621 
















GAT CTG GAA 


TTG 


AGG 


GAA 


ACA 


ACA 


TTT 


CCT 


asp leu glu 


leu 


arg 


glu 


thr 


thr 


phe 


pro 


4921/1641 
















CAT ATG ATT 


CTT 


TCT 


GAT 


ACT 


GTG 


AAA 


ATG 


his met ile 


leu 


ser 


asp 


thr 


val 


lys 


met 


4981/1661 
















ATT GAT CTA 


ATG 


TAC 


AGA 


ATT 


GCC 


AAG 


GGT 


ile asp leu 


met 


tyr 


arg 


ile 


ala 


lys 


gly 


5041/1681 
















TTG CAG AAC 


ATG 


GCA 


GGC 


AAG 


CAC 


TCA 


GAA 


leu gin asn 


met 


ala 


gly 


lys 


his 


ser 


glu 


5101/1701 


- 














CTA GTC CAC 


TCA 


GCA 


GCA 


CTT 


GTT 


GCT 


GAA 


leu val his 


ser 


ala 


ala 


leu 


val 


ala 


glu 


5161/1721 
















CTT CCT GTG 


GGA 


TGT 


GTA 


ACA 


TTT 


CAG 


AAT 


leu pro val 


gly 


cys 


val 


thr 


phe 


gin 


asn 


5221/1741 
















GTC TCA GAT 


GAT 


GTG 


GTA 


TCT 


CCA 


GAT 


GAA 


val ser asp 


asp 


val 


val 


ser 


pro 


asp 


glu 


5281/1761 
















GAG TCA GGA 


CTT 


GTG 


GGA 


TTA 


CTG 


GAA 


CAA 


glu ser gly 


leu 


val 


gly 


leu 


leu 


glu 


gin 


5341/1781 
















TAT GAA GCA 


GTT 


AAT 


GAA 


GTT 


TAC 


AAA 


GTA 


tyr glu ala 


val 


asn 


glu 


val 


tyr 


lys 


val 


5401/1801 
















GCA AAG AAA 


CTA 


TCC 


ACA 


ATT 


CAT 


GGT 


AAA 


ala lys lys 


leu 


ser 


thr 


ile 


his 


gly 


lys 


5461/1821 
















CAG AGT ACT 


GGC 


TGG 


GAG 


CGG 


ATG 


TTT 


GGC 


gin ser thr 


gly 


trp 


glu 


arg 


met 


phe 


gly 


5521/1841 
















AAG TTC GGG 


GAT 


TTG 


GAT 


GAA 


CAA 


GAA 


TTT 


lys phe gly 


asp 


leu 


asp 


glu 


gin 


glu 


phe 



4471/1491 

AAA GAG AGC ATT CTT GGT GGA GTG CTA AAA 
lys glu ser ile leu gly gly val leu lys 
4531/1511 

AGT GCA GTT TAT CTA CAA CAC TGT TTT GCT 
ser ala val tyr leu gin his cys phe ala 
4591/1531 

GAA CTC TTA TTT GAA GAA GAG ACA GAG CAG 
glu leu leu phe glu glu glu thr glu gin 
4651/1551 

CAC TGT AGC AGT AGC ATC GGT ACA ATA CGG 
his cys ser ser ser ile gly thr ile arg 
4711/1571 

ATG AGG CAA AAC TTT GAG ATT GGG AAT AAC 
met arg gin asn phe glu ile gly asn asn 
4771/1591 

TCA CTA TCC TCC TTG GTG GGC ACA TCT CAG 
ser leu ser ser leu val gly thr ser gin 
4831/1611 

CTA AAG ACT ATA TTG ACA TAT GCT GAA GAA 
leu lys thr ile leu thr tyr ala glu glu 
4891/1631 

GAT CAG GTC CAG GAT CTG GTT TTC AAT CTC 
asp gin val gin asp leu val phe asn leu 
4951/1651 

AAG GAA CAC CAG GAG GAT CCT GAA ATG TTG 
lys glu his gin glu asp pro glu met leu 
5011/1671 

TAC CAG ACC TCT CCA GAG CGA TTG ACC TGG 
tyr gin thr ser pro glu arg leu thr trp 
5071/1691 

CGA AGC AAT CAT GCT GAA GCT GCA CAG TGT 
arg ser asn his ala glu ala ala gin cys 
5131/1711 

TAT TTG AGC ATG CTG GAG GAC CGG AAA TAT 
tyr leu ser met leu glu asp arg lys tyr 
5191/1731 

ATT TCA TCT AAT GTT TTA GAA GAA TCT GCG 
ile ser ser asn val leu glu glu ser ala 
5251/1751 

GAA GGT ATC TGC TCT GGA AAA TAC TTT ACT 
glu gly ile cys ser gly lys tyr phe thr 
5311/1771 

GCA GCT GCT TCC TTC TCT ATG GCT GGC ATG 
ala ala ala ser phe ser met ala gly met 
5371/1791 

CTT ATT CCT ATT CAT GAA GCT AAT CGG GAT 
leu ile pro ile his glu ala asn arg asp 
5431/1811 

CTT CAA GAA GCA TTC AGC AAA ATT GTT CAT 
leu gin glu ala phe ser lys ile val his 
5491/1831 

ACC TAT TTT CGT GTT GGT TTT TAT GGA ACC 
thr tyr phe arg val gly phe tyr gly thr 
5551/1851 

GTT TAC AAG GAG CCT GCA ATA ACC AAA CTT 
val tyr lys glu pro ala ile thr lys leu 
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5581/1861 
















5611/1871 
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5641/1881 
















5671/1891 
















GAA GTA ATC 


AAA 


GAC 


TCT 


AAT 


CCT 


GTA 


GAC 


AAG TGT AAA 


TTA 


GAT 


CCT 


AAC 


AAG 


GCA 


TAT 


glu val ile 


lys 


asp 


ser 


asn 


pro 


val 


asp 


lys cys lys 


leu 


asp 


pro 


asn 


lys 


ala 


tyr 


5701/1901 
















5731/1911 
















ATT CAG ATT 


ACC 


TAT 


GTG 


GAG 


CCA 


TAC 


TTT 


GAC ACA TAT 


GAG 


ATG 


AAG 


GAC 


AGA 


ATC 


ACC 


ile gin ile 


thr 


tyr 


val 


glu 


pro 


tyr 


phe 


asp thr tyr 


glu 


met 


lys 


asp 


arg 


ile 


thr 


5761/1921 
















5791/1931 
















TAT TTC GAC 


AAA 


AAT 


TAC 


AAT 


CTT 


CGT 


CGA 


TTC ATG TAC 


TGT 


ACA 


CCC 


TTT 


ACT 


TTA 


GAT 


tyr phe asp 


lys 


asn 


tyr 


asn 


leu 


arg 


arg 


phe met tyr 


cys 


thr 


pro 


phe 


thr 


leu 


asp 


5821/1941 
















5851/1951 




• 












GGC CGT GCC 


CAT 


GGG 


GAA 


CTT 


CAT 


GAA 


CAA 


TTC AAA AGG 


AAG 
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ATT 


CTG 
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ACG 
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gly arg ala 
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gly 
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phe lys arg 
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ile 
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thr 


thr 


ser 


5881/1961 
















5911/1971 
















CAT GCC TTT 


CCT 


TAT 


ATT 


AAA 


ACA 


AGG 


GTC 


AAT GTC ACT 


CAT 
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GAA 


GAG 


ATC 


ATC 


TTA 


his ala phe 


pro 


tyr 


ile 


lys 


thr 


arg 
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his 


lys 


glu 


glu 


ile 


ile 


leu 


5941/1981 
















5971/1991 
















ACA CCA ATT 


GAA 


GTT 


GCT 


ATT 


GAG 


GAC 


ATG 


CAG AAA AAG 


ACA 


CAG 


GAG 


TTG 


GCA 


TTT 


GCA 


thr pro ile 


glu 


val 


ala 


ile 


glu 


asp 


met 


gin lys lys 


thr 


gin 


glu 


leu 


ala 


phe 


ala 


6001/2001 
















6031/2011 


* 














ACA CAT CAG 


GAT 


CCC 


GCA 


GAC 


CCC 


AAA 


ATG 


CTT CAG ATG 


GTA 


CTC 


CAG 


GGA 


TCT 


GTA 


GGC 


,,,^thr his gin 
y 6061/2021 


asp 


pro 


ala 


asp 


pro 


lys 


met 


leu gin met 


val 


leu gin 


gly 


ser 


val 


gly 
















6091/2031 
















Jj^ACC ACA GTG 


AAT 


CAG 


GGG 


CCT 


TTG 


GAA 


GTT 


GCC CAG GTT 


TTT 


CTG 


TCT 


GAA 


ATA 


CCT 


AGT 


^~^thr thr val 


asn 


gin 


gly 


pro 
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glu 


val 


ala gin val 


phe 
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ser 


glu 


ile 


pro 


ser 


W 6121/2041 
















6151/2051 
















MGAC CCA AAG 


CTC 


TTC 


AGA 


CAT 


CAT 


AAT 


AAA 


CTG CGA CTC 


TGC 


TTT 


AAA 
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TTT 


ACT 


AAA 


rU asp pro lys 


leu 


phe 


arg 


his 


his 


asn 


lys 


leu arg leu 


cys 


phe 


lys 


asp phe 


thr 


lys 


6181/2061 
















6211/2071 
















CRAGG TGT GAA 


GAT 


GCC 


TTA 


AGA 


AAA 


AAT 


AAG 


AGC TTA ATT 


GGG 


CCG 


GTT 


CAA 


AAG 


GAG 


TAT 


arg cys glu 


asp 


ala 


leu 


arg 


lys 


asn 


lys 


ser leu ile 


gly 


pro 


val 


gin 


lys 


glu 


tyr 


L 6241/2081 
















6271/2091 
















l.^ CAA AGG GAA 


TTG 


GGG 


AAA 


CTA 


TCT 


TCG 


CCT 


TAA 
















l^Z gin arg glu 


leu 


gly 


lys 


leu 


ser 


ser 


pro 


OCH 

















C3 AGAGGCCCTACAGCCCTAGATCACAGAAAGTCCCTCAGTTATCCAAGCCAGTATTGCTTGTCCCCTGCCACAGAGATTCC 
£3 TTCAGTCGAATGAGCTTTCGCAAAATGGATCTCTAAACTGAATGCACTTGTTTTATTCATCTGCAAAGAGCCATGTATTC 
AACATCGAGTGTGAAAAGATCTATTGGAAACCAACATGGAATGGAATTCTGGAAATTATTATTCATTGAAGAATGCAGTG 
GCCAAGAAAATATCAAATGTAGATTGTTAACGCTTGAGAATCATGGCTATGGTTTCTAATGTTCGGGTAACAAGCTGTTA 
TCTTTTAAGACATTTTAATGACTCAAAGGTACACTATACATTTACCATTATTTATACCATAGCTAAGGTTAAAAATTTAT 
TCACTTTAAGTTCGTATTTTTTAATTTATATCACCATTTATAGATTCATTTTGGACCCATTTTAAATGTAGTAATGCTTA 
TTTTAAAGGTACTAAAAAATATGTGAATGTTTACCTCGTGCGCGCCAGGGCCTC 
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A. Allelic Variations: single nucleotide changes (polymorphisms) between hCLASP-3 
cDNAs and/or genomic sequences 



isoform 


Difference 


Nucleotide 
position 


Consequence 


1 


polymorphism 


318 


A to G; missense 


2 


polymorphism 


323 


A to G; Glu to Gly 


3 


polymorphism 


2187 


T to C; missense 


4 


polymorphism 


3165 


T to G; Asp to Glu 










B. Alternative Exon usage 


Isoform 


Difference 


Nucleotide 
position 


Consequence 


1 


Exon deletion 


2768-2860, 
inclusive 


In frame deletion — 33 amino 
acids removed 











These differences may be found separately or together in various combinations in the 
differenct human CLASP-3 isoforms 
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Exon 60270 - 20370 

rfirnCCTTCfiTCCAGAAGATCAGCAG GTAAATATCCGGCGTGGGGCGC 



Exon 85360-85510 

r-.TTTTxr^TTTfnT'rATTrTTAfl AArflGTGGCAGCCGAAGTTA GGAAGCAGATC 
TrCGGACAATATAGTGGTTCTCCCCAACTGCTCAAAA ACCTTAATATTGTTGG 
rAATATATCCCATCACACCACAGTA AGTAACGTATTCAAAATATA 



Exon 94500 - 94720 

TCTTAT<"'"<" A A rTTTTT A r A A AnTT TGCCCCTTACCn A AGCAGTAGATCC AGTG 
rrATTTGGAAGATTACCTCATTACTCATCCTTTGGCTG TnrTATTCTGGGCCTTT 

A rfiTTGATTTnATTGAATTTCCTCCAGATGATATTGAAGTTGTTTATAGTCCTC 
nriOACTGCAGAACTCTTGTTTCAGCTGT ACrTGAAGAAAGGTAAGGAGACAT 

TGACTTATT 



Exon 94870 - 94980 

TATTTTr rTTTTTA A a ATA GTGAAATGG ATCC ACATGTTAGAGACTGTATAAG 
AAfiTTATACAGAAGACTGGGCAATTGTnATCAGAAAG TAAGTTATATGTTTA 

TTACAA 

Exon 100110- 100290 



ATTTATTTAAt' i I I I I I KM 1 I AATAG ATATCATA A ATTGGGAACAGGATTTAA 

TrrrAATACATTAGATAAAC AGAAAGAAAGGCAAAAAGOTTTGCCAAAACA 
AGTrTTTGAATCTGATGAAGCTCCAGATG GCAACAGCTACCAGGATGATCAA 

fiTAATACmTATTCITAAATAA 



Exon 100340- 100600 

ATATTTAATr.TTTTGrATnArAn nATGACCTTAAA AGACGTTCAATGTCAATA 

GATGATACCrrAAGGGGTAGCTGGGCCTGTAGTATC TTTGACTTGAAAAATT 

rACTTCCTGATGCTTTGCTTCCCAATTTACTTGATC GAACTCCAAATGAAGAA 

ATAGACCGTrAGAATGATGACCAAAGG AAATCAAACCGTCACAAAGAACnT 

TTGTTTTGCATCCATCACCAGATGAGGT ATAGATGTTTGCATATAAAGAA 
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Exon 100880-101020 

TTTTGGTGTTGCITITCAATTTGTAGGAAQAACCAM^^ 
rrTGATATACCCAAAGAACATTTTGGTCAAAGACTTCTTGTAAAA TGCTTATC 

ACTCAAGTG AGTAnTAnrCTnTACTTACAACT 
Exon 112010-112120 

TTTTTrTTrATAAAGG TrTGAAATTGAAATTGAACCCATITTTGCAAGTTTG^ 
fmTATATGATGTCAAGGAAAAGAAAAAGG TAAGArrATATAATTTGACCAT 

AGTTAT 

Exon 113680-113880 

A A r^TTT A Ar ATAPTA ATATTTTTTA GATTTCAGAAAA iTrTTTATTTrGACCTTA 
ATTCTGAGCAGATGAAAGGGTTGTTACGTCCACATG TACCACCTGCTGCCATT 
ArTACCCTGGCAAGATCAGCAATTrTTTCTATCACTTATCCrrCCCAAGAXC^ 
i T 1 1 C I T GTAATAAAGG TGAGAATAATGTTAAATATATTTG 



Exon 115020-115160 

TTAAT«^AArTTTTTTTr^.rTTTfiACAG rTAGAAAAAGTCCTACA 

nACATTGGAGAGTGTGCAGAACCATATATGATTTTCAAAGAAGCAGATGCCA 

CCAAGGTAGAATGTTATGCTTCTCATTrCCGCCAC 



Exon 117200- 117410 

AT<"-T ATA A AnTTrTOTTTTGGAG AATAAAGAAAAACTGGAGAAACTGAAGAG 
TCAAGCAGATCAGTTTTGCCAAAGACTTGGGAAATATCGCATGC CTTTTGCTT 
firiArTTGCAATCCATITAATGAATATTGT TAGCAGTGCTGGGAGTTTGGAAAG 
ArTATTCTACAGAAGTAGAA ATCAGTACTGGAGGTAAGAGTGnTCATACAAA 

AC 



Exon 123200 - 123396 

AAA AT^^ A Tl - n i i rn \ a ATTrTTTTGTA GAACGAAAAGGGTCTTGGTCAGA 
GAGGAGGAATTCTAGTATTGTTGGCAGACGATCACTTGAAAGGACAACAAGT 

GGAGATGATGCrTGTAACTTGACGAGC TTTCGACCAQ CTACT CTCACAGTGAC 
AAATTTTTTTAAGCAGG TATTGTTCTGTCATGTAGGAATnT 



A 
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GAAGGCCATCTTCTGTCTTACGGCGACTAAGACCTATTACAG) 
Exon 5560-5710 

rrmTCCTCTATTATT^'^ ^ Axr ah TtA AfiGAGACC G TTTAAGTGATGAAGATC 
T rTArAAATTrrTTGrTGATATGAGAAGGCr ATrnTrTGTCTTArriGCGACTA 
>V,^rrT ATTACAGG TATTTAAAAATTTTGAGTAGAAATGGTrGCA 

Exon 6680-6900 

TTACATrGTrmTAATATAT^ ATTTnr AGTTr AGCTCA AGATAGACAnTCTC 
^nr^ArrTGAAA ATrrrrATTATTGr(^AACTCr.CK3A^^^^ 

A /^TfTTTAGAGTTTCCCG 

AGAATTCTG 
Exon 38920 - 39075 

GTAnTACTATCATCTATACTGTTGCTTTCACAGA A ATrTTrTCTACATATACC 

Sry^AT!;^^^ ATTTTGrrAATPfiTrA AGGTrC -rr-^TA^-^ ^ ATATAACAGTG 
AAAGTCCAr.TrrATGTATr^TA 




A 
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GTAAGCTGCAGAGTACATGCAAAGTCCTTTCAGACACTTTGGCAACTAGTGAGTCATGT 
ATACAAAGAACTATTCAAAGCAAAATGTGGCCAGTTCTCTAAGAGAGTAGTGAAAAGAT 
GTTTGGAGAACAGAGATAAAGGCATGCAACATCTGGCTGGGAGTGTGAGAGCAGACAAC 
CAATTCAGGGACTATCAAACACCCAGGACAAGCTTAAGTACAGGATGAATAATGACAGT 
AACAGTGATAATAATAACAACAGCTGACACACATCGTACTTACCATGAACCAGGCCCTG 
TTCTAAGAACTTTATATTAATCAATTCAATT^TCTTCACAACACCCTTTCAAGTAGAT 
ATAATTATTTTCCTCCTTTTAGAGATGAAGTTAAGTAGCTTGCCTAAGGTTACAGAGTA 
AGAAAGTGGCAAAATAAGTCTGGGTTCCTATCACAGAGGGTCTTAAAAGCCAGGTACAG 
TGATAGCCTTGACTTACAGTCTGTGAGTACCTGGTTCATCTTTCCCAATGGACTCAAAC 
ATTCTGAGAGCAGGTCTGTGTCTAGTTCACCTTCGTGTTTTTTAGAACACCTAGGTGGG 
CTCTATACTTAGGTAGAAGCCAAATATCTGTCCTCTTTGAATAGACTATCTTTTCCTTT 
AGTAAACAAGCATCTGAAAACAGTGTGTTACTGGAGGGCGCCTTGGCAAGAGCCTGTGG 
TACTAGCTACTCCTCCAGAGGCTGAGGCAAGAGGATTGCTTGAGCCTAGGAGTTTTAGG 
CTGCAGTGCCCTATGCATTCCATGCTTGAGCAACACAGGGGGATTCTGTCTAAAATAAA 
TAAATCAATAAATAAAAGCAGTGTGTTACTGCAATGAGCATTGTTATATAAATACACAG 
TCTTCCACAGAAGTTACAGCACTAGACAGGAAGTTATGGATCTGACTAATTGAGGCTTT 
O CACCTCTGACTCAGTGTGTGACTCTGGCAAGTGACCTAATGGCTCCAAGCTACAGATTT 
V4 TCCAATGGTAAAAGAGAGGAAAACAGGTATCCAACATATTCCACCAGGACATACTGAGA 
ATATAATAATAACAATAAATAAATACACATTTTAAGTTCTTGCAATACTTAAGAGGAAA 
GGTACTACAAGAAACCAAGATATTTTTCTTAGTGTCACCATAATTTCAGTATCAACAGA 
^ TATGCATTAGGAGCCAAATATTCTAGACATTATGGGATTACAACAGAACAGATGAAAAC 
AAATCTCTGCTAGTCTCTGTTTTATAGGCTATTATGGACAAGGTCAGTTTCAGTGGTCA 
cS ACACAGAGCTTATTTAACAGAACATCAAAATGGGGGTGAACGTTAGAGCTCACCTGGAT 
r TTTATATATATATATATATATATATATATATATATATATATATGTATGAGAGACAGAGA 
H= GAGATCCAAAAGGGGTTAATAATACGGAAACAACTGTCCTCAAAGAAAAGAACCGGCTC 
fU TGGGTTTGAGTCTCAGTGGAAGAGTTTACCATCTGTGCGACCTGGGGCCAGTCACTTAG 
|- CTTTTCCAAGCCAATTTCCTTCCCTCTAAAATAGTGATAGTAACGATCTACTTCAARGC 
W GCTCGTGCTTGAAAAACTTTAATCCAGTGGCTGGCTAAGCACCAGAAGTCAACTATTAC 
g TATTGTTATCGCTGCAGGAGCAAGGTATTTTCTGGCTCTTTCTCTATAAGAAAAACCAC 
U TGTTTCTCCAGTTAGGCTGCATTCATGAGGCCGAAGCAGAAACCCAAGTGCTTTAAAAA 
GCTCGGAGACCGGTGCTGCAGAAACATGAACCCAGTCATCCAGGGCTTTGGTTAAAGCA 
CAGCCCCCTCGGCAAAGCCCACCTTTCATTTCCTTCCCTTCCATTTCGGGAAAGCCAAG 
CTCCCCCTACGCGTTCGGTTATCTTATTTTCTTGCCTCGCCAGGTCGCTGGCCTGTCTC 
TGGCTTCTCTCCGCGACCTCTCGGTCGTGCAGGCTCCGCGGCAGCGAAGCGGCTGGGGC 
TGGCGCCTCGCTTCCTCAGCGCTCCATTCTTCCCCTCGGCTCCCGCGGGCCGCAGCCGC 
CTTCCGCAGCCGGGGTTCCCGCCGGGATTGACGCGCTGGGGGAGGAGCGGTTTCTCGTT 
GCGCGCCTCTAAGGAACATTACGGCAGGGCTCGTTCCTGGCTCCGGCCGCCAGCCCCAG 
CCTCCCAGGTCCGGAGCCCGGACTGGCGGAGGCCGCGAGGGAGGGAGCACGAGCGAGGA 
GGCACGCGCCCGCCGGTCCGCACCCGGCCCGCCGCAGCCGCCGCCGC CGTCGCCGTCGC 
CGCAGCAGCCATGGCCGAGCGCCGCGCCTTCGCCCAGAAGATCAGCAG 



B 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 

hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCIASP2 
hCLASP? 
hCLASPl 



MFPMEDISISVIGRQRRTVQ 20 

MTHLNSLDVQLAQELG 16 

MAERRAFAQKISRTVAAEVRKQISGQYSGSPQLLKNLNIVG 4 1 

MLLFPYDDFQTAILRRQGRYICS 23 

-MAASERRAFAHKINRTVAAEVRKQVSRERSGSPHSSRRCSSSL 43 

MSFRGKVFKREPSEFWKKRRTVRRVIQEEFHRFSSQEKPRLLEPLDYETVIEELEKTYRN 60 



N 



STVPEDAEKRAQSLFVKECIKTYSTDWHWNYK 53 

, _ PPT 19 

ISHHTTVPLTEAVDPVDLEDYLITHPLAVDSGPLRDLIEFP 83 

TVPAKAEEEAQSLFVTECIKTYNSDWHLVNYK 55 

Q VPLTEWEPLDFEDVLLSRPPDAEPGPLRDLVEFP 79 

DPLQDLLFFPSDDFSAATVSWDIRTLYSTVPEDAEHKAENLLVKEACKFYSSQWHVVNYK 120 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCLASP? 
hQASPl 



YEDFSGDFRMLPCKSLRPEKIPNHVFEIDEDCEKDED- 

DDDLDWFTPKECRTLQP-SLPEEGVELDPHVR 

PDD I EWy S PRDCRTLVS - AVPEE - S EMDPHVR 

YEDYSGEFRQLPNKWKLDKLPVHVYEVDEEViDKDED- 
ADDLELLLQPRECRTTEP-GIPKD-EKLDAQVR 



•SSSLCSQKGGVIKQG 105 

DCVQTYIREWLI 63 

DCIRSYTEDWAI 126 

'AASLGSQKGGITKHG 107 
- — AAVEMYIEDWVI 122 



YEQYSGDIRQLPRAEYKPEKLPSHSFEIDHEDADKDEDTTSHSSSKGGGGAGGTGVFKSG 180 



• * 



h.CtASP4 
tvitASPS 

K(:Lasp3 

h&LASP2 
h<kiASP7 
Hei-ASPl 



hS^LASPS 
HCLASP3 
liCLASP2 
lle'LASP7 
h^LASPl 



WLHKANVNSTIT — VTMKVFKRRYFYLTQLPDGSYILNSYKDEKNSKESK-GCIYLDACI 1 62 
VNRKNQGS PE I C — G FKKTGS RKDFHKT - LPKQT FESETLECSEPAAQA — GPRHLNVLC 118 
VIRKYHKLGTGF — NPNTLDKQKERQKG-LPKQVFESDEAPDGNSYQDDQDDLKRRSMSI 183 
WLYKGNMNSAIS— VTMRSFKRRFFHLIQLGDGS YNLNFYKDEKISKEPK-GS I FLDSCM 1 64 
VHRRYQYLSAAY—SPVTTDTQRERQKG-LPRQVFEQDASGDERSGPEDSNDSRRGSGSP 17 9 
WLYKGNFNSrVNNTVTVRSFKKRYFQLTQLPDNSYIMNFYKDEKISKEPK-GCIFLDSCT 239 



DWQCPKMRRHAFELKMLDKYSHYIJVAETEQEMEEWLITLKKIIQINTDSLVQEKKETVE 

DVSGKGPVTACDFDLRSLQPDKRLENLLOQVSAEDFEKQNEEARRTN RQAE 

DDTPRGSWACSIFDLKNSLPDALLPNLLDRTPNEEIDRQNDDQRKSN RHKE 

GWQNNKVRRFAFELKMQDKSSYLIAADSEVEMEEWITILNKILQLN FEAAMQEK 

EDTPRSSGASSIFDLRNLAADSLLPSLLERAAPEDVDRRNETLRRQH RPPA 

GWQNNRLRKYAFELKMNDLTYFVIJU\ETESDMDEWIHTUmiLQISPEGPLQGR^ 



222 
169 
234 
219 
230 
299 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCUtSP7 
hCLASPl 



TAQDDETSS QGKAENIMASLERSMHPEIMCYGRETEQLNKLSRGDGRQNLFSFDSE 278 

LFALYPSVD EEDAVEIRPVPECPKEHLG N -RILVKLLTLKFEIE 212 

LFALHPSPD EEEPIERLSVPDIPKEHFG QRLLVKCLSLKFEIE 277 

RNGDSHEDD EQSKLEGSGSGLDSYLPELAKSAREAEIK LKSESRVKLFYLDPD 272 

LLTL YPAPD EDEAVERCS RPE PPREHFG QRI LVKCLS LKFE I E 273 

TDLGLDS LDNS VTCECT PEETDS S ENNLHADFAKYLTETEDTVKTTRNMERLNL FS LDP D 3 5 9 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



VQRLDFS-— GIEPDIKP^FEEKCNKRFLVNCHDLTFNILGQIGDNAKGPPTNVEPFFI 333 

lEPLFAS lALYDVKERKKISENFHCDLNSDQFKGFLRAHTPSVAASSQARSAVFSV 268 

lEPIFAS LALYDVKEKKKISENFYFDLNSEQMKGLLRPHVPPAAITTLARSAIFSI 333 

AQKLDFS SAEPEVKS-FEEKFGKRILVKCNDLSFNLQCCVAENEEGPTTNVEPFFV 327 

lEPIFGI LALYDVREKKKISENFYFDLNSDSMKGLLRAHGTHPAISTLARSAIFSV 329 

IDTLKLQKKDLLEPESVIKPFEEKAAKRIMI ICKALNSNLQGCVTENENDPITNIEPFFV 419 
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hCLASP4 
hCLASPS 
hCLASP3 
hCIASP2 
hCLASP? 
hCLASPl 





NIJa.FDVKNNCKISADFmT)I^PPSVREMLWGSSTQIASDGSP KGSSPESYIHGIAE 390 

TYPSSDIYL^/VKIEKVLQQGD IGDCAEPYTVIKESDG GKSKE-KIEKLKL 317 

TYPSQDVFLVIKLEKVLQQGD IGECAEPYMI FKEADA TKNKE-KLEKLKS 382 

TLSLFDIKY^^lKISAD^^VDLNHFSVRQMLATTSPALMNGS GQSPSVLKGILHE 381 

TYPS PDI FLVIKLEKVXQQGD ISECCEPYMVLKEVDT AKNKE-KLEKLRL 378 

SVALYDLRDSRKISADFHVDLNHAAVRQMLLGASVALENGNIDTITPRQSEEPHIKGLPE 479 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



SQLRYIQQGIFSyTNPHPEIFLVARIEK\^QGNITHCAEPYIKNSDPVKTAQK\mRT;^^ 450 

QAESFCQR LGKYRMPFAWAPISLSSFFNVSTLEREVTDVDSWGRSPVGERRTLA 372 

QADQFCQR LGKYRMPFAWTAIHLMNIVSSAGSLERDSTEVEISTGERKGSWSERR 437 

AAMQYPKQGIFSVTCPHPDIFLVARIEKVLQGSITHCAEPYMKSSDSSKVAQKVIJa^ 441 

AAEQFCTR LGRYRMPFAWTAVHLANIVSSAGQLDRDSD SEGERRPAWTDRR 429 

EWLKFPKQAVFSVSNPHSEIVLVAKIEKVLMGNIASGAEPYIKNPDSNKYAQKILKSNRQ 539 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCLASP7 
hCLASPl 



heLASP4 
hClASPS 
hbLASP3 
hbLASP2 
hl!ILASP7 
hC^ASPl 

hC^*ASP4 
hfe^LASPS 
H<3JVSP3 
HCLASP2 

hiS^LASPl 

hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



VCSRLGQYRMPFAWAARPI 
QSRRLSERALSLEENGVGSNFTCTS 
NSSIVGRRSLERTTSGDDACNLTSFR-I 



FKDTQGSLDLDGF FSPLYKQDSSKLSSEDILKLLSEYKKPE 



TLS VSSFFKQEGDRLSDEDLFKFLADYKRSS 

PATLtl VTNFFKQEGDRLSDEDLYKFLADMRRPS 



acqrlgqyrmp eawaartl fkdas gnldknaf fsai yrqdsnklsnddmlklladfrkpe 

rrgpq — drassgddacsfsgfr-patltvtnffkqeaerlsdedlfkflamrrps 

fcsklgkyrrafawavrsvfkdnqgnvt)rdsf|fsplfrqesskistedlvklvsdyrrad 



510 
427 
496 
501 
483 
599 



--KTKLQIIPGQLNITVECVPVDLSNCITSSYVPLKPFE-KNCQNITVEVEEFVPEMTKY 567 
S LQRRVKS I PGLLRLE I S TAPE 1 1 NCCLT PEMLPVKP FP- ENRTRPHKE I LE FP~ TRE V 484 

SVLRRLRPITAQLKIDISPAPENPHYCLTPELLQVKLYP-DSRVRPTREILEFP—ARDV 553 
K-MAKLPVILGNLDITIDNVSSDFPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKH 560 
SLLRRLRPVTAQLKIDISPAPENPHFCLSPELLHIKPYP-DPRGRPTKEILEFP—AREV 540 
R-ISKMQTIPGSLDIAVT)rfVPLEHPNCVTSSFIPVKPFNMMAQTEPTVEVEEFVYDSTKY 658 



CY pftiyknhlyvyplqlkydsqktfakarniavcvefrdsdesdasalkciygkp agsv 

YVPHTVYRNLLYVYPQRLNFVN--KLASARNITIKIQFMCG-EDAS1«MPVIFGKS 3GPE 
YVPNTTYRNLLYIYPQSIJ^FAN--RQGSARNITVKVQn4YG-EDPSNAMPVIFGKS5CSE 
TC PYTI YTNHLYVYPKYLKYDSQKS FAKARNIAICIEFKDSDEEDSQPLKCI YGRP 3GPV 
YA PHTSYRNLLYVYPHSLNFSS — RQGSVRNLAVRVQYMTG-EDPSQALPVI FGKS SCSE 

CRPYRVYKNQIYIYPKHLKYDSQKCFNKARNITVCIEFKNSDEESAKPLKCIYGKP sgpl 



627 
541 
610 
620 
597 
718 



FTT^ AYAWSHHNQNPEFYDEIKIELPIHIJiQKHHLLFTrYHVSCi INTKGTTKKQDfVE 687 

FLQEVYTAVTYHNKSPDFYEEVKIKLPAKLTVNHHLLFTFYHISC^O KQGAiVE 595 

fsk^aytavvyhnrspdfheeikvklpatltdhhhllftfyhvscJq kqnt3le 664 

HE 680 

FTR^FTPWYHNKS PE FYEE FKLHLPACVTENHHLLFT rYHVSC<JP RPGTitLE 651 

iE 778 



• • • 



TPVGFAWVPLLKDGRI ITFEQQLPVS ?^NLPPGYLm*NnAESRRQCNVrtKWVDGAKPLLK 
TLLGYSWLPILLNERLQTGSYCLPVA LEKLPPNYSMHSAEKVPLQNPP IKWAEGHKGVFN 

TPVGYTWIPMLQNGRLKTGQFCLPVS LEKPPQAYSVLSPEA^ LPG ^«WVDNHKGVFN 

TQVGYSWLPLLKDGRWTSEQHIPVS ANLPSGHLGYQELGMGRHYGPE IKWVDGGKPLLK 

TPVGFTWIPLLQHGRLRTGPFCLPVS VDQPPPSYSVLTPDVA LPG ^WVDGHKGVFS 

TSVGYAWLPLMKHDQIASQEYNIPIA TSLPPNYLSFQDSASGKHGGSE IKWVDGGKPLHC 



747 
655 
721 
740 
708 
838 
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'hCLAS'P4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP? 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASPl 
hCLASPl 





FKSHLESTIYT(3^pr/HKFFHHCQLIQS] GSKEVlPK.IKyLKCLHAM 794 

I EVQAVS S VHTQDNHLEKFFTLCHS LES QVT FP I RVLDQKI S EMALEHELKLS 1 1 CLNS S 715 
VE WAVS SIHTQDPYLDK FFAL VNALDE H - L FPVR I G DMR IMENNLENE LKS S I S ALNS S 780 

ISTHLVSTVYTQDQHLHNFFQYCQKTES GAQALGNEL^/KYLKSLHAM 787 

VELTAVSSVHPQDPYLDKFFTLVHVLEE 3-AFPFRLKDTVLSEGNVEQELRASLAALRLA 767 
VSTFWSTVNTQDPHVNAFFQECQKREKD MSQSPTSNFIRSCKNLLNVE 887 



E llQVMI QFLPVI LMQLF R 



SP EPLVAFSHHVLDKLV RLVIRPPI 



K ipAIMS FLP 1 1 LNQL^tK- 



VLTNMTH- 



EDDVP 824 
775 

IINRLHKNLEGNHDQHG 840 

VLT-RAT QEEVA 816 

ISGQIVNLGRGAFEAMAHWSLVHRSLEAAQDARG 827 
VLVQNE EDEIT 916 



PLVLFLHLVLDKLFQLSVQPMVIAGQTANFSQFAFESWAIANSLHNSKDLSKDQHG 
Qlp P WR FLHLLLDKL I LLV I RP PV I AGQ I VNLGQAS FEAMAS 
EG KVMIAFLPTILNQLE R 



INCTMV-LLHIVSKCHEEGLDS YLRSFIKYS FRPEKP 

RNCLLASYVHYVFRLPEVQRDVPKSGAPTALLDPRSYHTYGRTSAAAVSSKLLQARVMSS 
RNSLLASYIHYVFRLPNTYPNSSSPG-PGGLGGSVHYATMARSAVRPASLNLNRSRSLSN 

VNVTRV- 1 IHWAQCHEEGLES HLRS YVKYA YKAEPY 

HCPQLAAYVHYAFRLPGTE PSLPDGAPP VTVQAATLARGS GRPASLYLARSKS I S S 

TTVTRV-LPDIVAKCHEEQLDH SVQSYIKFV FKTRAC 



860 
835 
899 
852 
883 
952 



Hi!3ASP4 
hSJASPS 
hGLASP3 
nCJLASP2 
hgLASP7 
l^i^LASPl 



SAPQAQLIH — ETLATTMIAILKQS r~ 883 

SNPDLAGTHSAADEEVKNIMSSKIADRNCSRMSYYCSGSSDAPSSPA 882 

SNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAESTQAMDRSC 959 

VASEYKTVH EELTKSMTTILKPS 875 

SNPDLAVAPGSVDDEVSRILASKLLHEELA-LQ 915 

KE— RPVH EDLAKNVTGLLKSN 972 



liCLASP4 
IfCLASPS 
lkeLASP3 
SC-LASP2 
&^LASP7 
icLASPl 



t — z 



fiCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



ADFLS INKLLKYS WFFFEI lAKSM 

APRPASKKHFHEELALQ MWSTGHVKSM 

NRMSSHTETSSFLQTLTGRLPTKKLFHEELALQWWCSGSVRESALQQAWFFFELMVKSM 

ADFLTSNKLLRYS W FFFDVLI KSM 

S S AVREA I LQHA W F FFQLMVKSM 

: DSPTVKHVLKHS WFFFAI ILKSM 



Cadherin Cleavage 



ATYLLEENKIKLflRGQRrPETYHHVLHSLLLAIIPHVTIRYAEIPDE SRNVmSLAS 

AQHVHNMDKRDS5 RRTR rSDREMDDITTIVNWTSEIAALLVKPQKENEQAEKMNI SLAF 

VHHLYFNDKLEAE RKSR TPERFMDDIAALVSTIASDIVSRFQKDTEM VERLNl SLAF 

AQHLIENSKVKLIRNQR rPASYHHAAETWNMLMPHITQKFGDNPEA SKNANI- SLAV 

ALHLLLGQRLDTi RKLR rPGRFLDDITALVGSVGLEVITRVHKDVEL AEHLN^ SLAF 

AQHLIDTNKIQLf RPQR fPESYQNELDNLVMVXSDHVIWKYKDALEE TRRATt SVAR 



FLKRCLTLMDRG FX FKL INDY I S G FS PKDP 
FLYDLLSLMDRGFVF ^ILIRHYCSQLSAKLSNL 



I^VLAE YKFEFLQT I CNHEHYI PLNL 
TL I SMRLE FLRI LCSHEHYLNLNL 

ICSHEHYVTLNL 



FLNDLLSVMDRGFVF SLIKSCYKQVSSKLYSLPNPS VLVSLRLDFLRI 

FIKRCFTRdDRGFVF <QINNYISCFAPGDP F TLFEYKFEFLRWCNHEHYIPLNL 

FLSDLLSLVDRGFVFSLVRAHYKQVATRLQSSPNP;iALLTLRMEFTRILCSHEHYVTLNL 
FLKRCFTTODRGCVF KMVNNYISMFSSGDL ¥ TLCQYKFDFLQEVCQHEHFIPLCL 



907 

910 

1019 

899 

942 

995 



964 

970 

1076 

956 

999 

1052 



1019 
1027 
1136 
1011 
1059 
1107 



FIG. 8 
3 of 6 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 

hC;&VSP4 
hc£ASP5 
hciASPB 
hdiiASP2 
h^MASP7 
hCHEASPl 



hpLASP4 
h|C0*ASP5 
hCLASPS 
l>cyj^P2 
h£iASP7 
hbLASPl 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCLASP7 
hCLASPl 





Cadfierln EC motif 



PMAFAKPKLQR VQDS- -NLEYSLSDEYCKHHFLVGI LLRE TS I 10 60 

FFMNADTAPTSP—CPSISSQNSSSCSSFQDQKIASMFDLTSEYRQQHFLTGILFTELAA 1085 
PCS LLT P PAS PS PS VS SAT S QS S G FS TNVQDQKI ANM FELS VP FRQQHYLAGI VLTE LAV 1196 

PMPFGKGRIQR : YQDL—QLDYSLTDEFCRNHFLVGI LLRE VGT 1052 

PCCPLSPPASPSPSVSSTTSQSSTFSSQAPDPKVTSMFELSGPFRQQHFLAGILLTELAL 1119 
PIRSANIPDPLTP SES TQELHASDMPEYSVTNEFCRKHFLIGi LLRE VGF 1157 



ALQDN YE IRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQLYLPFVGLLLENI 2RL 

ALDAE GE G I S F VQRKAVS A I HS LLS S HDLDPRCVKPE VKVK I AAL YL PLVG 1 1 LDAL ? — 
ILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAMLYLPLIGIIMETV ?~ 

ALQEFR EVRLIAISVLKNLLIKHSFDDRYASRSHQARIATLYLPLFGLLIENV JRI 

ALEPEIAEGAFILHKKAISAVHSLLCGHDTDPRYAEATVKARVAELYLPLLSIARDTL ?~ 
ALQEDQ I^^mHLALAVLKNIilAKHSFDDRYREPRKQAQIASLYMPLYGMLLDl^ 



1116 
1143 
1254 
1108 
1177 
1213 



AGRDTLYSCA— AMPN-S ASRDEFPCGFTSPANRGSLSTDKDTAYGS 1160 

QL CDFTVADTRRYRTSGSD 1162 

. QLY DFTETHNQRGRPICIATDD — 1276 

NVRDVSPFPVNAGMTVraESIJaPA-VNPLVTPQKGSTLDNSIJiKDLI^ISGIASP 11 67 

RLH ' DFAEGPGQRSRLASMLDSDTE 1201 

YLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFSKDVLNSIAAFSSIAIS 1273 



FQ- NGHG I KRE DSRGS L I PEGATG FPDQGNTGEN TRQSS TRS S VS QYNRLDQYE 1213 

EEQEGAGAINQNVALAIAGNNFNLKT- SGIVLSSLPYKQYNMLHADT 1208 

YESESGSMISQTVAMAIAGTSVPQLTR — —PGSFLLTSTSGRQHTTFSAES 1324 

STPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSE 1227 

GEGDIAGTINPSVAMAIAGGPLAPGSR AS ISQGPPTASRAGCALSAES 1249 

TVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPLALIGSTLRFDRLDQAE 1327 

• • • 



RSLLMCYLYIVKMISEDTLLTYWNKVSPQELINILILLEVCLFHFRYMGKRNIAR /HDA 

RNI^ICFLWIMKKADQSLIRKWIA£)LPSTQLNRILDLLFICVLCFEYKGKQSSDK/STQ 
RS L L I C LLWVLKNADETVLQKW FTDLS VLQLNRLLDLL YLCVS C FE YKGKKV FER WS L 

3|KSLLMCFLYILKSMSDDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIAR sIQEG 
RTLLACVLWVLKNTEPALLQRWATDLTLPQLGRLLDLLYLCIAAFEYKGKKAFEFtNSL 
TIRS LLMC FLH IMKT I S YETL I AYWQRAPS PE VS D FFS I LDVCLQN FRYLGKRNI 



* « 



IR <LAA 



1273 
1268 
1384 
1287 
1309 
1387 



WLSKHFGIDR— 
VLQKSRDVKAR- 
TFKKSKDMRAK- 
LGPIVHDRKS— 
TFKKSLDMKAR- 



-KSQTMPALRNRSGVMQARLQHLSSLESS 1311 

LEEALLRGEGARGEMMRRRAPGNDRFPGLNEN 1311 

LEEAILGSIGARQEMVRRSRGQLERSPSGSAFGSQ 1430 

QTLPVSRNRTGMMHARLQQLGSLDNS 1323 

LEEAILGTIGARQEMVRRSRERSPFGNPEN 1350 



AFKFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHKQHRSQTLPI IRGKK 1442 



FTLNHSSTTTEADIFHQALLEGNTATEVSLTVLDTISFftTQCFKTQLL 



1359 

— LRWKKEQTHWRQANEKLDKTK^ELDQEALISGNLATEAHLIILDMQENltlQASS-ALD 1368 

;-VTE 1489 
1371 

IKQTVM-LSE 1407 

1500 



ENLRWRKDMTHWRQNTEKLDKSR; EIEHEALIDGNLATEANLI ILDTLEIV /QTVS 

LTFNHSYGHSD;jDVLHQSLLEANIATEVCLTALDTLSLl|rLAFKNQLL 
— VRWRKSVTHWKQTSDRVDKTKI EMEHEALVEGNLATEASLWLDTLEI 

—ALSNPKLLQMLDNTMTSNSNElDIVHHVDTEANIATEGCLTILDLVSLFtTQTHQRQLQ 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 





^mIX;HNPIi^KKVFDIHIAFLKNGQSEVSLKHVFASLRAFISKFPSAFFlCG 1419 

CKDS LLGGVLRVLVNSLNCDQSTTYLTHCFATLRALIAKFGDLLFEEEVEQCFDLCH 1425 

SKES ILGGVLKVLLHSMACNQSAVYLQHCFATQRALVSKFPELLFEEETEQCADLCL 154 6 

ADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIYKFPSTFYEGRAI^IC^^ 1431 

ARES VLGAVLKWLYSLGSAQSALFLQHGLATQRALVSKFPELLFEEDTELCADLCL 14 64 

QCDCQNSUlKRGFDTYMLFFQVNQSATAIJaiVFASLRLFVCKFPSAFFQGPADLCGSFCY 1560 



•k-k 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



EVLKCCTSKISSTRNEASALLYLLMRNNFEYTKRKTFLRTHLQIIIAVSQLIADVALSGG 1479 
QVLHHCSSSMDVTRSQACATLYLLMR — FSFGATSNFARVKMQVTMSLASLVGRAPDFNE 14 83 
RLLRHCSSSIGTIRSHPSASLYLLMR — QNFEIGNNFARVKMQVPMSLSSLVGTSQNFNE 1604 
EILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTHLQVIISVSQLIADWGIGE 1491 
RLLRHCGSRISTIRTHASASLYLIJtm--QNFEIGHNFARVKMQVTMSLSSLVGTTQNFSE 1522 
EVLKCCNHRSRSTQTEASALLYLFMRKNFEFNKQKSIVRSHLQLIKAVSQLIADAG-IGG 1619 



• • • • • 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hC;iASP4 
hC;iASP5 
hG]|ASP3 
hGLASP2 
hC3ASP7 
hgLASPl 



hiGLASP4 
NCltASPS 
l^GiASP3 
hjq|aASP2 
h<iLASP7 
h^JASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP? 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



SRFQESLFIINNFANSDRPMKATAFPAEVKDLTKRIRTVLMATAQWKEHEKDPEMLIDLQ 
EHIJlRSLRTILAYSEEDTAMWTPFPTQVEELLCNIJ^SILYDTVKMREFQEDPEMI^iDI^ 
EFLRRSLKTILTYAEEDLELRETTFPIXlVQDLVF^aJ^MILSDTVK^IKEHQEDPEMLIDLM 
TRFQQSLSIINNCANSDRilKHTSFSSDVKDLTKRIRTVUlATAQ^iKEHEN^ 
EHLRRSLKTILTYAEEmGLRDSTFAEQVQDLMFNLHMILTDTVKMKEHQEDPEMLIDLM 
SRFQHSIAITNNFANGDKC»1KNSNFPAEVKDLTKRIRTV121ATAQMKEHEKDPEML\^ 

• •••• •••••• 

transmembrane 

YSXJUCSYASTPELRKTWLDSMAKIHVKNGdFSEAAMCYVHVAALVAEFllHRKK 
YRIAKSYQASPDLRLTWLQNMAEKHTKKKC YTEAAMCLVHAAALVAEYI SMLEDH 
YRIAKGYQTSPE-RLTWLQNMAGKHSERS^ HAEAAQCLVHSAALVAEYI SMLEDR 

YSLAKSYASTPELRKTWLDSMARIHVKNGI LSEAAMCYVHVTALVAEYI TRKG — 

YRIARGYQGSPDLRLTWLQNMAGKHAELG^ HAEAAQCMVHAAALVAEYI ALLEDQ 



YS LANS YAS TPELRRTWLESMAK I HARNGI 



LSEAAMC Y I H I AAL I AE YllKRKG YWKVEK I 



1539 
1543 
1664 
1551 
1582 
1679 



1592 
1598 
1718 
1604 
1637 
1739 



LFPNGCSAFKKITPNIDEEGAMKEDAGMMD 1622 

SYLPVGSVSFQNISSNVLEESWSEDTLSPDEDGV 1 633 

KYLPVGCVTFQNISSNVLEESAVSDDWSPDEEGI 1753 

VFRQGCTAFRVITPNIDEEASMMEDVGMQD 1634 

RHLPVGCVSFQNISSNVLEESAISDDILSPDEEGF 1672 

CTASLLSEDTHPCDSNSLLTTPSGGSMFSMGWPAFLSITPNIKEEGAAKEDSGMHD 1795 



ITAM 



VHYSEEVLLELLEQCVDGLWKAERYE 1 1 SE I SKLI VP I YEKRREFEKLTQV YRTI HG 1679 

CAGQYFTESGLVGLLEQAAELFSTGGLYETVNEVYKLVIPII£AHREFRKLTLTHSK1QR 1 693 

CSGKYFTESGLVGLLEC2AAAS FSMAGMYEAVNEVYKVLI PIHEANRDAKKLST I HGKI QE 1813 

VHFNEDVLMELLEQCADGLWKAERYELIADIYKLIIPIYEKRR 1677 

CSGKHFTELGLVGLLEC2AAGYFTMGGLYEAVNEVYKNLI PILEAHRDYKKLAAV HGKI QE 1732 
TPYNENILVEQLYMCGEFLWKSERYELIADVNKPIIAVFEKQRDFKKLSDLYYDIHR 1852 



ITAM 



DOCK motif DOCK moti f 

^^AftYGQ^FFEEEljGKE^ljYKEijKLTGLSEXSLRLV^ 
.MG flFG- SKFGDLDEQE FV YKEE AI TKLPE I SHRLEAI "YG 



; YTK] LEVMHTKKRLLGT FFR 
? FDS 3 VNKDH — KRM FGT Y FR 

J^FSK ]j VHQS TGWERMFGTjY FRA^G BjYG- ^FGDLIIeQE 

E FFEDEEGKEII YKEE KLTPLSEISQRLLKI ,YS 
^[FTKljMHQSSGWERVFGItY FR\JGE|YG-^ HFGDLE EQE E \ YKEE S I TKIJ^ I S^ 
YLK\'AEVVNSEKRLFGP YYR\J AF YGQC FFEEEEGKEl EYKEE KLTGLSEISQRLLKl TA 



* « « 4 



ITAM 

1739 
1750 
1872 
1710 
1791 
1912 
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IT AM 




ITAM 



EK FGTENVKI I QDSDICVNAKELDP? YAH IQ\. T YVKl Y FDDKELTERKTE FERNHNI SRrV 



QC FGAE EVEVI KDST PVDKTKLDP> KAY I Ql 
EB FGEDWEVI KDSNPVDKCKLDPh KAY IQI 
DK FGSENVKM I QDS GKVNPKDLDS I YAY I Q\ 
EBFGDDWEIIKDSYPVDKSKLDSC KAYIQI 
DK FGADNVK 1 1 QDS NKVNPKDL D P f YAY I Q\ 



* * 




TFVEE YFDEYEMKDRVTYFEKNFNLRRFM 



TYVEE 



TYVEE 



Y FDT YEMKDR I T Y FDKNYNLRRFM 



THVIJ FFDEKELQERKTEFERSHNIRRFM 



Y FDT YELKDRVT Y FDRNYGLRT FL 



itYVra FFEEKE lEDRKTDFEMHHNI NRFV 

t it -k * * ' • * • 

t • • • • > 

DCX:K motif 



ITAM 



FETyPYTLSGKKQGCIEEQCKRRTILTTSNSFPYVra RIPINCEQQI> LKPIDGATDEIKD 
YTTPFTI^GRPRGELHEQYRRNTVLTThIHAFFYIKlRISVIQKEEF\LTPIEVAIEDMKK 
YCTPFTLDGRAHGELHEQFKRKTILTTSHAFP YIKI RVNVTHKEEi: LTPIEVAIEDMQK 
FEMP FTQTGKRQGGVEEQCKRRT I LTAI HC FP fVKt R I PVMYQHHT I LNP I EVAI OEMS K 
FCTPFTPDGRAHGELPEQHKRKTLLSTDHAFP YlKl RIRVCHREET\ "LTPVEVAIEDMQK 
FETPFTLSGKKHGGVAEQCKRRTILTTSHLFPrVKI'RIQVISQSSTE LNPIEVAIDEMSR 
: ♦:* ^: :* : ** :♦•*:*:: : ** [*:* J *; : . ; * ; ; ; . 

Coiled-coil 



KTAELQKLCSSTDVDMIQLQLKLQC WVSVQVNAGPLAYARAFLNDSQASKYPPKKVSELK 
KTLQLAVAINQEPPDAKMLQWVLQC SVGATVNQGPLEVAQVFLAE I PADPKLYRHHNKLR 
KT QELAFATHQDPADPKMLC»1VLQC S VGTTVNQGPLEVAQVFLSE I PSDPKLFRHHNKLR 
KVAELRQLCSSAEVrWIKLQLKLQC SVSVQVNAGPLAYARAFLDDTNTKRYPDNKVKLLK 
KTRELAFATEQDPPDAKMLQMVLQC SVGPTVNQGPLEVAQVFLAEIPEDPKLFRHHNKLR 
KVSELNQLCTMEEVDMISLQLKLQC SVSVKVNAGPMAYARAFLEETNAKKYPDNQVKLLK 



Coiled-coil 



DMFRKFIQACS: ALELNERLIKEDQVEYHEGLKSNFRDMVKELSDI IHEQILQEDTMHSP 
LC FKEFIMRCGI AVEKNKRL I TADQRE YQQELKKNYNKLkENLRP^ I ERKI PELYKP I FR 

LCFKDFTKRCEI )ALRKNKSLIGPVQKEYQRELGKLSSP~ . — 

EVFRQFV^CG< >AIAVNERLIKErX3LEYQEEMKANYREMAKELSEI ^HEQICPLEEKTS- 
LCFKDFCKKCEI >ALRKNKALIGPDQKEYHRELERNYCRLREALQPL LTQRLPQLMAPTP- 
EI FRQFADACGOALDVNERLIKEDQLEYQEELRSHYKDMLSELSTV WEQITGRDDLSKR 



1799 
18ia 
1932 
1770 
1851 
1972 



1859 
1870 
1992 
1830 
1911 
2032 



1919 
1930 
2052 
1890 
1971 
2092 



1979 
199^0 
2090 
1949 
2030 
2152 



PD Z lioa nd 
WMSNTLHVFCAISGTSSDRGYGSP riYAEVf — 
VESQKRDSFHRSSFRKCETQLSQGS 



2008 
2015 



VLPNSLHIFNAISGTPTSTKVHGMTS jSSWl 1980 
— PGLRNSLNRASFRKADL — 2047 

GVDQTCTRVISKATPALPTVSIS ^A£V4 — 2180 
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